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Hi. pr^i mvenlion relm to systems ^ „«h„b for i„t«,ctiva, se«chi,« a 
| d,..basc in such a , ^ a,^ ^ ^„ ^^^^^ 

• : V S*'''*' '^'^ '^^O' info-ma'ion usbg ™.Wpte 

■ ilso^totcs ,0 business m«ho* associated with p„vidi,g M.n.^r, ,o users based on tt. 
■ ^■■'f-''<=^t^m>«n,^6:r,^^0^^ ^^^^ 

■ taveota also relates .0 [delete: building and m««^^^ 
, , ;^;^fabase based on. content aggregation, managcment.and distribution. 



■ , •^P'«"'™ve„«o»«*r«.t«i.o systems and metlKKi, for quickly and efHci«^^ 
■^Wevinginforn^onfhm.oonecdonofdataordatab^^^^ For putposes of example, the 
, |?5mc. is teparagonofacoliection of data from which i, is ditfieult^efflci^^^^^ 
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!: j,?;M|sired data. But it will be appreciated that the present invention is applicable to any 



; ' v;'"^''^: There is currently more information floating around than at any time in our history, 
'•^^^ ^Slft fo exists in the niillions upon millions of books, documents, records, libraries, 

' 5 '^ t^^rchiyes, di databases, and catalogs that individuals all must use to work, live, arul 

v ■ ' f i^^ with other human beings. But while there is more information available and more 
Ayays to access it than ever before, finding information individuals need when they need it still 
: V ■ r . ^^^^ most challenging, time-consuming, and frustrating experiences of life in 

'''f0^ 'X \ From the earli^t conception of the Internet until the present time, one of the 
' :^^ljaHenges fi^ anyone seeking to use the Internet is figuring out how to find a specific. 



' ' -kJ' ^^l^^ively^mall^amount of inforaiation from among the vast amount available on the Internet 



i>: * whole industry is devdted td the developmeriTt of better ways and means to help 

' • V i^!?^ do just tfiat. One'such group of developments is search engines. Search engines allow 
.1^' / , psers to type in a tenm and receive back a laundry list of Web sites that are associated with 
• /that term. 

■ The act of accessing the Internet to obtain or find information has come to be called 

> the lnternet^dr "surfing the Web" which is directed at a very popular part of the 

: ke^ the Wodd Wide Web ("Web" for short). When a person initiates a "search" on the 
; ,Web he or she attempts to find infomiation using one or more methods presently at their 
j1v:J^||!?P^^^^'^^^'^^^^ conducting Intern^ searches have been implemented. 

/ f^^^^^ methods suffer firom a variety of shortcomings. 

> ; ■ f Figure 1 is a visual representation of a database 1. This database 1 is made up of a 
. plurality of records 2. Each record may consist of a single character, a string of characters, a 
iS^i^,;^P^^^ strings of characters, an image, an audio file or any combination of the preceding, 

V.'. 2 

' ..Vll^li-;-.-.,,- : 

. •:. ./.vJiV.j,.,-.' ;'■ 



r^ilie size of the database 1 can be described by making reference to the number of records 2 
v! ■ ■ Iff ^>t*^^ Large databases may contain millions of records, 
/ • '{■ ^ ^ ; 'r i The task of an Internet search engine is to provide the user with a list of links to Web 

; . 5^^^ the search engine calculates are likely to hold infonnation desirable to the user. 
. 5 This list is compounded by using a search term or query 3. One method of compounding this 
, / • A "full-text" search algorithm identifies records tiiat contain key 

: • >^SW every irecord; In otiher words, the search process effectively identifies 

\\ ■ such as record 2 that contain the search term 3, When the search is completed, a 

' > V, count of the total number of records containing the search tenn(s) is compiled and 

"^^^^ ^ *^st of links to those records to, allow the user to view the records. 
^^^^'^^^^^'^^ the number of matches, e.g., *%000 matches," liiiks and descriptions of the first few 
; ' jm^^ displayed to the user. The user reviews the number of matches and the 

' , • i ^^^^^^^ descriptions of some of the matched records and either decides to try a different 

.s^rch m ail attempt to shrink the number of matches or selects one listed link to access a 



" \>^^^i^'$f^^^^ record. 



' - ; problem with these types of search engines is the often-large number of matches 

jf^fl f t^^^^t^^^ the user. If a user enters . die search, term "tires " he/she may receive over 1 million 
: 1 million records looking for the best or 

. • « r;;^^^^^^^^^^^^ record that he/she needs, 

^wi^^^ If the user edits the search term(s), he/she may pare the number of matches down from 
J;:^ ? to 200,000i butthis number.of matches is still too large for a user to view and use to 

: • : ;; make an effective decision. The user may then try to re-edit the search terms in an iterative 
prpcess until the number of matches is manageable. However, this iterative process of re- 
•^^i ^^i^^^'^^* consumingand may frustrate the user before he/she receives the 



^fcT/USfll/10185 

In an effort to reduce this frustration, search engines were developed that categorize 
•. ;^^;?'^'>'dsandprovidethecategoriestoiheusers^ 
■ . records before executing a searchusing search temi(s). 

. . . , Fig^e2.showssomerecords205.210and215 fromdatabasel. -meserecordsare 
;. ^®^*'°P^'y*'^*®Sorics250showaare''Virginia.»'Tairfax;'''McLean," 
■ ;-|:ii^°'^'''^'*"Chantin^^^^ Thesecategories 250 relate to state, county, and city. • 

I ff^t^^iS^r^"*"^^^ For example, if a 

; r ;ijf*dc0ntains data which relates to a certain geographic area such as a state, then that record 
: • tag identifying its ife^ Other records that do not 

■ili^^°^'^^'**^****^^8^8^Wcaica.areijottagged These tags 

; ■ J . ; ;'are.;later usedto identify andretrieve records, containing data related to certain geographic 

; : ar^. As a furttier exainple. ifa record contains the word "Virginia." then that record is 

. .. . , a tag called "VA." 

? i'^iSjil ■■ '^^^^^S*^^^*^"^^^ 205, 210 and 215.aretagged with a single toconomy because 
. .Jl5.:-,;,,:^an^^^^^^ categories250 represent.a class or subset of the taxonomy "Location." Assuming 
. records within database 1 are categorized, database 1 can be referred to as a "single- 

-I^Si^;' ■^'''^^^ definitions, it is clear that ataxonomy is a hierarchical organizatbn of 
^;^|;j^p|ories^and thevarious to^ 

, organize the records- in a database. TTiis organization of the records, in turn, makes it easier to 
' search for^ retrieve, and display records containing specific data. In other woids, a user may 

i .■;;yf'irl^:$^^ 1 if flic records in database 1 are 

■.•..'l^roijerly. tagged. 

, V;: -^;. , Typically, taxonomies and categories are selected from among those characteristics 
. ^5. H^i^attribbtes which a user would intuitively think of to launch a search. For instance, a user 



■ -i:^ . ■ ■ 



) ; ^attempting to find a physician in McLean, Virginia, using a Web search engine would 
, . : 'forrnulate a search based on certain intuitive characteristics, one being the "location" of all of 

: -the physicians m database 1 . This intuitive characteristic becomes a taxonomy. This search 
' : . ./pan-be narrowed by using attributes; such as "state » "county" and "city." Hicse intuitive 
. * 5 . / ttributes ate categories within (he taxonomy. 

.■/^'':; ^®P"'''^*^^'*'"<'5*'^'^v«nt'<>nal search tools based on categories is tiiat they on!^ 
, , Pfoyide aiemer with a single taxonomy. For example, assume that a user searc^^ 
• , . . ; |te?c,o'nomy called "i;x)cation" and a category called "Vii^inia" to identify all of the pharmacists 
;i • / i-' Ift^^*^"" ' Suppose now. however, the user wishes to identify only those pharmacists who 
v,: ' J^i;^^^^^^^ a single-taxonomy, categorized search this means launching a 

; ' ;." search because"retair' is neither an attribute nor a characteristic related to "location." 
K. "retai^^^^^ of location and is related to a different taxonomy, such as 

j:,;,;^.M^^Erpductsand Services." 

Dr '^^'^ to alleviate this problem, many single-taxonomy, categorized search engines 
is'-' •■ .albw Boolean operations. Thus, if the user discovers that there are 10,000 pharmacists in 
; ; , v i:,k;!^!{Sinja. he/she may further refine this search by searching for the word "retail." Thus, the 
' t 'l¥>l''fef'^**^ ^"^^ to be "Pharmacists" AND "Health Insurance and Information" in ttie 

. ■ ' pltegory "Virginia." Hiis type of search modification is only marginally effective, for several 
reasons. First, the use of a^Boolean search at this point usudly entails the initiation of a ne^ 
i ■ the search engine, because it does not provide a taxonomy, cannot suggest 

¥ -^(^^^^^^^^^^^^ the sMith to. the desired data, which requires the user to be clear about 
'■y^^K^^i^^ the Boolean query terms in advance. Third, such a search engine is inefficient 
. , , ,t>^ause it requires an exponenrial increase in the number of operations to produce a set of 
• ' . hits. 



Another problem with Finding information in product catalog databases is that the user 
. J^^i^opexi. asked to choose multiple parameter attributes that end up defining a product that 
';'^%^:^dQ^t exist. For example, a user may. be interested in finding a used automobile satisfying 



•i; \ i^tii^^^ criteria: greater than 200 horsepower, less than 10,000 miles, greater than 50 

: ? : ?|i|^|es)jper gillbn fiiel efficiency, and a price less than $ 1 0.000. After spending time naming 
'•vvlifMllliiese'pax^ reveal that no product contains all these attributes. An 

: | :i (stiteniative embodiment in the present invention is to have the user first specify the one or two 
that are most important and then present the user only with valid, non-zero 
;:=tf||^||:a^e^^ regarding products in the catalog. For example, in a "step search" process, the user 
: \ : -;;lo^sf^|^^ consider the attribute of in excess of 200 horsepower as the most important. The 
;;|Ss^^^ would then inform the uscar how many cars there are that contain this attribute and 
to view these results from a variety of perspectives, like by price (e,g. 10 
■i V;;ISl^fen $lO>OO0-$2OiOOO, 50 between $20,000-30,000 and 100 in excess of $30,000); by fuel 



' • (e,g.^80 between 10-20 mpg^ 60 between 20-25 mpg and 20 in excess of 25 mpg); 

''''0!M^y0>y- mileage (e.g. 50 between 0-20,000 miles, 50 between 20,000-50,000 miles and 60 ia 
' ^i;|-:^^essof50;000•TO^^ 

toan-attempt to address data searching of ever increasing databases, many techmques 
I' cj^/have been^developed. For example, U.S. Pat No. 5,675,786 relates to accessing data held in 
' ' ■ ; ''ifM lai^e computer databases by sampling the initial result of a query of the database. Sampling 
iv? '02^^^^^ result is achieved by setting a sampling rate which corresponds to the intended 

r ' ;?^i>ratl0^at which^ records of the initial result are to be sampled. The sampling result is 

■ • than ttie initial query result and is thus easier to analyze statistically. 

: • = i^^f^^^iSifc this of data sent as a result of the query to the end user, it 

j^^^ in an initial search of what could be a massive database. Further, dependent upon 



^Ithe^sampling rate, sampling may result in a reduction in the accuracy of the infonnation sent 

I ■-•■»':.:•.•■,•)■:, 

t6 the end user and may thus not provide the intended result 

Another example, U.S. Pat. No. 5,642,602, relates to a method and system for 
{ ' 1 ^s&Lrching and retrieving documents in a database. A first search and retrieval result is 
5 It Jjompiled on the basis of a queiy. Each word in both the query and the search result are given 
^ ; ^a^weighted value, and then cotiibined to produce a similarity value for each document. Each 
^.fdocument i s^ ranked according to the similarity value and the end user chooses documents 



Af^Mfrcih the ranking. On the'basis of the documents chosen ftom the ranking, the original query 



' . 'V'^' ^s^updated% a second' search aiid a second group of documents is produced. The second 
to ' \ group of documents is supposed to have the more relevant documents of the query closer to 

^ the top of the list While more relevant documents may be found as a result of the second 
I ^ search, the patent does not address Ae-pioblems associated with the searching of a large 
!f^^'^f^^^^^^ fact, might only compound them. Additionally, the patent does not return 

. r ^^categorized: se^ch results complete witih counts of the number of records associated with 
iswv^ttiose categories. 

r • • ■ 

w ^ Yet another example, U.S. Pat. No, 5,265,244 relates to a method and apparatus for 
^V,^^ ^fdi&a'acctss^usiiifg'a particular data structure. The structure has a plurality of data nodes, each 
for storing data, and a plurality of access nodes, each for pointing to another access node or a 
, r V ' data node ^Information, of a Statistical nature, is associated with a subset of the access nodes 
'20 ' and data nodes in which the statistical information is stored. Thus statistical information can 
\ [ \ I ?i^i5;retrievcd tisiiig statistical queries which isolate the subset of the access nodes and data 
; |l;^j^|riodes whichiconmin the stotistical information. While the patent may save time in terms of 
p %yaccess to- the statistical information, user access to the actual data records requires further 
.procedures. 
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Further, U.S. Patent No. 5,930,474 discloses a search engine configured to search 
^ i^geographically and topicaUy, wherein the search engine is configurable to search for user- 

entered topics within a hierarchically specified geographic area. This system makes use of a 
' . /^static index of results for each taxonomy, not a dynamic search which precludes the abilit>' to 
5 ^ ,switch among multiple taxonomies. The system is also not text searchable at any time during 
^ ^ /^a* drill-down, or. taxonomy switch. The system also doesn't include counts of records with 

^categoiy results. . . 
ij « y g Patent No. 6.0 12,055 discloses a search system comprising multiple navigators 
switchable.by tabs in the GUI, having the ability to cross-refer«ice amongst said navigators. 
1 0 This is just a method for accessing different information sources, not a method for text- 
< ^ V^^searching. Further, it do,es not offer user-categorized search results with counts. 



^ U.S. Patent No. 5,682,525 discloses an online directory, having the capability to 



, * display an advertisement incorporated within a map display, wherein the said map has indicia 

for points of interests selected by a user from a drop down menu. This inveiition describes a 
! 5^il» targeted advertising based on categories selected within a 

» * ■ 

/^(^MfiierarchiGal taxonomy. This invention does not consider cross-sections of categories across 
tnultiple taxonomies, i.e. location, business type, and products/servicjes; Nor does this 
invention consider the addition of keyword searches as a further limiting item for identifying 
^^Jtargeted advertising, . . 
'2Q%^ff ' U .S, Patent No. 6,078,9 1 6 discloses a search engine which displays an advertising 

banner having a keyword associated therewith, wherein the keyword is related to a user- 

'''''' f'^^'^^^B' 

>i ^^entered .search topic. This invention discloses' a method for organizing information based on 
. V. ij|te'e statistics and heuristical information derived from a user's behavior. 

tv 

;\j ' Megaspider, a meta-search engine, has a web directory with hierarchically arranged 
. 25 , geographic regions, having subcategories therein for topics, said directory being searchable 



iyithin a geographic area or within a topic. However, MegaSpider's search technology 
^ ^ employs a static hierarchical drill-down and cannot execute a fiiU-text search and return 
/^&ategorized search r with counts. Additionally, this system only has one hierarchical 
^^^^^takonomy and cannot switch between multiple taxonomies, nor yield categorized search 



5) ^results with counts when searching. 

/\ ' ' U.S. PatentNo, 5,832,497 discloses a system which enables users to search for jobs 
v'/ lif y g^^^g^'^^cal location and specialty. While this invention does discuss an iterative method 
"''! ^«or finding information in a multi-dimensional database, it does not consider categorized 



.search results mth counts (i.e. the ability to conduct a field or free-text search and have the 

10; , results be returned by one or many sets of hierarchically organized categories with counts of 
//^f ^theiiiumber of records associated with each of those categories), nor the ability to switch 

' /^f«among taxonomies. 
<Y 46 However, none of these conventional systems provide users with a multiple- 

. " 4^|.;ltaxonomy, multiple category search engine that allows users to search for records, where the 

v^'.^jjr* , ■ . . . • 

' 15 «^J;..^user is allowed to toggle among the multiple taxonomies as an aid to locating desired records 

V ' '/ \ • 

'wi thou t constraints. 



' '•'l^'w? Traditional search engines ate also not generally compatible widi small screens such 



\ '^^ as' on cell phones, pagers and personal digital assistants (PDAs) and pahn-held devices. This 
' s :^ {vis^ecause these traditional search engines deliver long laundry lists of record hits that the 



'^20'^ vuser is required to scroll through. Transmitting these long laundry lists requires substantial 

. ^Bandwidth. Generally, an increase in use of bandwidth by a user translates into an increase in 

^^^^^ W'^^i^^Y . . - ■ 

fteost Additionally, these small screens only allow the display of one or two record hits. This 

, Timakes it cumbersome for the user to compare the record hits to determine which one best 

'I : ^P'^^M^' '^^^^^^ requirements. The present invention, in contrast, provides a mechanism for 
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;.;;;toggling among taxonomies so as to narrow die display such that it may fit onto a small 

\'''*'"'' '^^^Iscreen. 

,; Additionally, ^traditional search engines do not provide ways to effectively relate 

>: ;;^5|||a|iner advertising to the user viewing the search results. As an example^ suppose a user 

the search term "Virginia" AND 'Tharmacists " The search engine may place a banner 
' ^^if the results Web page to a pharmacy in Virginia that is hundreds of miles away from the 
: This ad placement is not valuable to the user or the merchant. Thus, there is also a need 
N^i^^^ a user is searching for in a more specific manner so that banner advertising 

Vmaybe provided to that user where the advertising is more closely related to what the user is 

• ■ • SUMMARY OF THE INVENTION 

The present mvention overcomes the shortcomings identified above. More 

the present invention is a multi-taxonomy, multi-categoiy search tool that allows 



to **navigate" through.a database using any of the taxonomies at any time. 
In addition, the present invention overcomes the identified shortcomings of other 
'ssearch engines when small screen devices are employed to display search results. More 



. ;,S ,, ^^.^;specifically, the present invention transmits and displays categories for users 



J^^^ users with long .laundry lists of record hits. 



to select from 



S \ Through the presentation of categorized search results, the present invention allows an 
J' enormous database to be. represented in a very small footprint, which is ideal for wireless 

■ ; • ? ;'l^||;^eyices.. • 

" V : % - ' fiirther, the present invention provides a mecham'sm for "slicing-and-dicing'* the 
' in a database, thus, allowing the creation of personalized or customized data 

25:^!^fjtcljp!^^^^ of information. 



' V ' ' ThiB present invention provides such advantages by means ofa system for search 
^collection of data, said system comprising; an organizer configured to receive search requests, 
^a(iel?6rganizer comprising: a collection of data having at least two entries; wherein the 
/ I f i^cdiiection of data is organized into at least two taxonomies; wherein each of the at least t^o 
5 . -taxonomies is associated with at least two categories; wherein the entries correspond to at 
ieast one of die at least two taxonomies and also conespond to at least one of the at least two 
. /categories; aiid a search engiile in communication witti the collection of data, wherein said 
^^ Y^,r>^s|e^ engine is configured to search based on the at least two taxonomies and based on thie at 

, it>|east two Categories, wherein the search engine returns, in response to a search request 
: I'o! O^iientifying at least a first taxonomy of the at least two taxonomies, a list of the categories 
' . t'^iassociated with the at least first identified taxonomies* along with the number of entries 
'Li, .^"associated with each of the categories associated wifli the at least first identified taxonomies. 
r ^ M ^ The above advantages are further provided through the present invention, which is a 
' ' V'\system< iFot searching a collection of data, said system comprising: means for networking a 
15 ' ^plurality of computers; and means for organizing executing in said computer network and 
> . configured to receive search requests fi:om any one of said plurality of compiuters, said means 
^4^ comprising:- a collection of data halving at least two entries; wherein the 

kH i^^^^ is organized into at least two taxonomies; wherein each of the at least two 

' / taxonomies is associated with at least two categories; wherein the entries correspond to at 

20 ' least otib of the at least two taxonomies and also correspond to at least one of the at least two 

> * it 

.J'^^ ^^^^ate and means for searching in communication with the collection of data, wherein 
f^^said. means for searching is configured to search based on the at least two taxonomies and 
. 1 ' ' ^based on the at least two categories, wherein the means for searching returns^ in response to a 
search request identifying one of the at least two taxonomies, a list of the categories 



associated with the identified taxonomies, along with the number of entries associated with 
,V !v^acK of the categories issbciated with the identified taxonomies. 

^P4^4)-: .... . 

! The above-identified advantages are fiirther provided through a system for searching a 
' eollection of diata, said system comprising: means for networking a plurality of computers; 
5 and means for organizing executing in said computer network and configured to receive 
/ i^search requests from any one of said plurality of computers, said means for organizing 
\X ^ f^comprising: a collection of data having at least Uvo entries; wherein the collection of data is 
, ^ ^organized into at least two taxonomies; wherein each of the at least two taxonomies is 



^'I^ associated with at least two categoric; wherein flie entries correspond to at least one of the at 

. 1...... . i . 

;]0.i%eas 



h 6 >>^^least'two taxonomies and also correspond to at least one of the at least two categories; and 



^ '^r^ means for searching in communication with the collection of data, wherein said means for 
^ ^searching is configured to search based on the at least two taxonomies and based on the at 
i least two categories, wherein the means for searching returns, in response to a search request 
, i ^ .s identifying one of the at least two taxonomies, a list of the categories associated with flae 
'^15 '4 ^ identified taxonomies, along with the number of entries associated with each of the categories 
; » .associated wifli the identified taxonomies. 

Additionally, the above-identified advantages are provided through an article of 
ifV X'tnanufacture coinprisirig: a computer usable medium having computer program code means 



r -,J$';\embodied^there6ci for seairching a collection of data, the computer readable program code 
\20 '\^rmeans in said article of manufacture comprising: computer readable program code means for 
' */;V&^(^mmumcating a search request to a search engine, die search engine being in conununication 
with a collection of data; wherein the collection of data has at least two entries; wherein the 
collection, of data is organized into at least two taxonomies; wherein each of the at least two 
. , , , taxonomies is associated witfi at least two categories; wherein the at least two entries 
25|tji correspond to at least one of the at least two taxonomies and also correspond to at least one of 

' ' • ' 12 



. >'|diis„querying of ttie collection of data. 

„;4u«y are gteeted with aff '-W view of Ihe entae data coueono 
;i.^itea^^d — sctvi«on.he,.t™etW.hapin8i...no.hen«d.,p^^^^^ 

>:';'*^3,r«io.o,tK.u=e,/Ini,,..«»cs.h«edatac«n.«ooi^^^^^^^ 

- . « ^,.t„r^ B electronic ptoduct catalog, product type, 

^ nVmore than one independent category structure (e.g.. electronic p 

\';1^™a.ionno™d.«et»tpc,sp<cUv..aUho«*«o„e»bodi™e..of*.p.e^^ 
... . , .,.ep.e«ch"t.xo,»mi«a.notta™d«ccd.n«..h.us.rha.drineddown.oaspcc^c 

::#L<^.«.a.-.t..»^"™es 
■'>••;;^„„tl.,heu,«ba,...ec«d■«lt«hingcat=go,,in.hc.p»^ 

■ %,;,,oal«:goiymtl«'Ti<lduct Type" taxonomy. 

' • s,eps««h«le.p«to%.pPly<osemep..ductsiptheelec.,omcca.alog, 

• .1, oc ''Price " "Promotions" and "Brands", apply to all 
^ while traditionaltaxonomies, such as Pnce, rromouuub 

' 1 A "KA^«;*or ^i7fi" taxonomy is obviously inapplicable to a 

^'-^i|f'->.^ ^ MomtorSize taxonomy 
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; f(^er;.se^ for clofliing products as much as a "Style" taxonomy is inapplicable to a user 
; y for a computer. A "Price" taxonomy, however, would apply to a user searching for 

' ' ' I: Users thus have the. ability to intuitively navigate through huge amounts of 

5 iinfoTination by using keywords and categories in conjunction with the different taxonomies of 
; r collection. These navigation features are a significant aspect of this data collection 

^: S^s^ixih that differentiates it from conventional search technology. 

When a user knows what he/she is looking for, the invention quickly uncovers the 
:■ V ;^pght infom without forcing the user to^ go through numerous irrelevant search results, 
:;^|^^|iejiei^ search technology comes when users do not know or are only vaguely 

what they want. In these instances, where a user needs to browse through all or 
? . 5:i;i|pairt;?P listings, keyword searches with categorized search results (from different 

; will facilitate easy navigation by providing the user with context and scope 

y. V to the search results and by giving a user the information he/she needs to find the 

>:^S^m^^^3:^c\B, services and information they requh:ed. 
- i The present invention provides users with an aerial view of the data collection at all 

I^ll'l^^rr^ a search. Users remain aware of where they stand in their search and how many 

iS^i,- ^-^ - ^ • 

|||j:^^^||)hls p^^ satisfy their query. More importantly, users receive categorized search 

^'^ffM^^M^ that provide summary information on the records in the data collection that remain 
: 2^;||||ij^^ parameters of a search. 

U ^i^Jf Users of (he present invention can look for information using keywords they feel will 
d'^^^^^J^fi^ search. The system will locate every record in the data collection that 

..... r'^VVU..(,.^-. . .■ _ . 

i^ebntains that particular word or phrase and instantly return all the data categories (at the 
:::-^, ,^i^^^^^cw6l of the search as then being conducted) that have associated records. The search 
^iS^^estilt^ indicate how many records exist within each applicable category, and allow users to 
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r l^l^isily hone down on the specific segment of the data coUection he/she is interested in and, 
i • importantly, to disregard all other inelevant information. 

~ . For example, if a user enters the search teim.«wheel alignment," the system would 

„ : ' . ' s?aTch all the records in the data collection that contained the term "wheel alignment" 
; ^lll^ther.th^n returning a long list of 1,701 search results that satisfy the user's query, the 
1- : . J' .^^sent inveiition provides the user with the categories that are associated with the remaining 
K : - \ fiords aiid indicates how many records are associated with each category. This functionality 
' 'assists the user to further refine his/her search and disregard the irrelevant informatioa 
' ; , . , These searched data collections provide users with summary information (categorized 
results) about the data collection being searched. Users need not use pulMown menus 
;,.J,.pff n »nany "required" fields to construct the parameters of their search (zip code, city, 
.'•Ji^businesscategoty^ Rather, search results display the valid categories and indicate how 
:, • \ . ■ , 4m records are associated with each applicable category. Users are thus presented with the 
• ® ^P***^"^ *® collection (through a dynamic aisle and shelf stnictuic) andcm 
hierarchically organized data collection information or switch among 
. 0-^,>,tjij^oriomies to find what they require. 

' ; • • /. If a user within the Healthcare Providers Category clicks on "Physician," the present 
:P(fn'ionproceais down the hierarchy and presents the user with the next level categories 
''JM^^^-^^ physicians by area of specialization. 

>|fr ^ instances where data collection information can be associated with more than one 
^f/yi^ii^aeRendent category structure (e.g., product type, color, size, brand, price, promotions), users 
V invention can switch among taxonomies of the electronic product catalog at any 
■;. , . >time during the search process and look at information from different perspecUves. although 

., ,tl.ii pne.embodiment of the present invention "step search" taxonomies are not introduced until 
25 theiuspr has drilled down to a specific category in the "Product Type" taxonomy. For 
•■' 15 



.y/!^^j^le» the "Style." "Color," and "Size" taxonomies are "step search" taxonomies because 



J . not presented as options to the user until the user has selected a clothing category in 

X ' Ji^^^^roduct Type" taxonoiiiy: Likewise, taxonomies for "Processor Speed," •'Hard Disk • 
■ • III!' 'Woi^or Size^"^and'"Memoiy Amount" are not presented as options to the user unUl 
: is- ' j^^^^ has' selected a computer category in the "Product Type" taxonomy. 

Step search taxonomies preferably apply to some products in the electronic catalog. 



; traditional taxonomies, such as "Price," '^Promotions" and "Brands", apply to all 



t' I^S^tS in the electronic catalog. A "Monitor Size" taxonomy is obviously inapplicable to a 
7 ;, : ;;|^p|ftsearching for clothing products as much as a "Style" taxonomy is inapplicable to a user 
% SSSlfeung for a computer. A "Price" taxonomy, however, would apply to a user searching for 

I- Viii^Si- ■ If a user clicteoff the'"Price"- tab, thepresent invention will instantly reorganize all 
!i ;5i$i^|electronic records that remain within the parameters of the search (regardless of number) 
''i:! ':'\iii|pi|prMeM (he same-information categorized by a "Price" taxonomy of the electronic product 
0 is-^ ISktalpg. Switching among taxonomies is possible at any point in the search process. Further, 
'}.[, ;4 ;,:;';'Iicertaiti taxonomies are designated as "step search" taxonomies are presented to the user as 
;|)^^fc|fbrred'bptions Wheri^tK&userhas^dri^ category mthe "Product Type" 



' ^ ttSl^- The data collections repHcate existing business paradigms IVom the physical world on 
• •■sovil^ib^thelnternet landscap aisle and shelf structure and humanistic interface can 

S ^^S^ftlab companies retain current users, acquire new customers, and maximize tiie value of their 
;v #5 This'furi6tionality also spawns new and innovative revenue and business 

. ; . ■ : '0:'' hod^is that help monetize eyeballs and turn Internet browsers into buyers, 
'■■S I^is understood that the Internet provides an unprecedented opportunity to collect and 

i i;|??^^pTMialyije data. The present invention also improves the collection of user data because users 

"^^;*iyjj|;||^ ' ' • 16 " 
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■' data;CoIlecUQn information by drilling down hierarchically organized 

, :; ..'P|a,tegories;Using their mouse or wireless, keypad. Each time the user clicks down a category 
^iror switches his/her taxonomy to a-dififerent category structure, there is the opportunity to 
■ accumulate real-time marketing information that can be responded to interactively or later 
5 collected, analyzed and used to derive revenues. Cumulatively, this additional information 
} - "^*^<9n»ers (demographics, decision patterns, trends, preferences) is more meaningful 

' ' • '*?nd;Can kelp manage customer relations and product development. 



V ^ V \ 
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.i' ^invention; 



BRIEF DESCRIPTION OF THE DRAWINGS 
I?igure I. is a simplified diagram of a database; 
Figure 2 is a simplified view of various records; 

Figure 3. is a system in accordance with a preferred embodiment of the present 



vfl ^ ' ' figures 4r8 are screen shots a user would see when using an embodiment of the 
'^ii^P'^'sent inMentipn as applied to a yellow page directory; 

1 , Figure 9 is a representation of how a query interacts with indices and how those 

^ records in-a database according to an embodiment of the present invention- 
, ^'Sures 10-12 represent process steps a user would go through to drill down to a set of 

). , -f repordsin a database, in^ accordance with an embodiment of the present invention; 

Figuiie. 13 is.a system in accordance witha preferred embodiment of the present 
//.fnyeptipri-;.,- ; . 

Figure 14 shows a searching process in accordance with an embodiment of the present 
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,^tin\y.ention; 



' Figure ISisascreenshotofacategorizerinaccordancewithanembodimentofthe 

jj^^||^||sen^ 

' ■ 17 . 
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: ■ '', "Jjl ? Figure 16 is a representation of categories arid reads in accordance wiHi an 
; ^^^bodimcnt of the present invention; 

• Figure 17 illustrates a method of distributing, indexing and retrieving data m a 
■dMuted data retrieval system, according to an embodiment of the present invention; 
■:5?lM:; ■■: Figure 1 8 illustrates the distribution of data information and the formation of sub- 
;;|^ii&tipns in a distributed data retrieval system, according to an embodiment of the present 
■ 'V • Ijnyention; 

Figure 19 illustrates an inverted index from which a sub-collection view can be 
■ ;; l^pErated in a distributed data retrieval system, according to an embodiment of the present 

'iiiP^^raiieniiGin; 

: ^^r'^^^:^^ Fi^re 20 illustrates a sub-collection view, according to an embodiment of the present 
. / " Jiriyention; 

Figure 21 illustrates the paths of communication forming a network between a central 
-j: ^Ciputer^d a series of local computers m a distributed data retrieval system, according to 
• 15|||an embodiment of the present invention; and 
• ^^l/'. Figure 22 illustrates a global view, according to an embodiment of the prcsoit 

'•ly^^wirivention. 

■4$m> i)Et AILED DRSCRIPnON Q P Ttf P. INVENTldN 

'^^^^'iRithiSt decode. Typically, such an on-line computer service provides access to a 

: ) ^hietarehically structured database where information within the database is accessible at a 
: i:'' MflpW^^ of computer servers which are in communication via conventional telephone lines or 
: ''r^lp. litjks, and a network backbone. For example, the Internet is a giant internetwork created 
' ' :§J^<0Tiginally by linking various research and defense networks (such as NSFnet, MILnet, and 



G)Bi-line computer services, such as the rntemet, have grown immensely in popularity 



<!.^REN). Since the origin of the Intetnet, various other private and public networks have 
• ^^^ome attached to the Internet 

The structure of the Internet is a network backbone with networks branching off of the 
' ^ibackbone. These branches, in turn, have networks branching off of them, and so on. Routers 
5 ^^raove information packets between network levels, and then from network to network, until 
S|be^packet reaches th^ of its destuiation. From the destination, the destination 



^^^^network's host directs the information packet to the appropriate terminal, or node. For a more 
, ;^^^etailed description of the structure and operation! of the Internet, please refer to "The Internet 
' •^ '^f^mplete Reference," by Harley Hahn and Rick Stout, published by McGraw-Hill, 1994. 
lol.. A user may access the Internet, for example, using a home personal computer (PC) 

^ ,,qquipped with a conventional modem. Special interface software is installed within the PC so 
^ ^ ithat when the user wishes to access the Internet, a modem within the user's PC is 
^'Vl'w to dial the telephone number associated with the local Internet host 

> " '^^lel^er The user can then access information at any address accessible over the Internet. One 
15 ^>i;well-known software interface, for example, is the Microsoft Internet Explorer (a species of 
. ^ ' .JfflnrP Browser), developed by Microsoft, 



[ \ Information exchanged over the Internet is often encoded in HyperText Mark-up 



'^Language (HTML) format HTML encoding is a kind of markup language which is used to 

/\ 

' define document content information and other sites on the Internet. As is well known in the 
20' ^ ' art, HTML is a set of conventions for marking portions of a document so that, when accessed 
' ^ parser, each portion.appears with a distinctive format. The HTML indicates, or "tags, " 
* . %fwhat portion of the document the text corresponds to (e.g., the title, header, body text, etc), 
' 'tJand the^parser actually, formats the document in the specified manner. An HTML document 
' ^4^sometimes. includes hyper-links which allow a user to move from document to document on 
isi^^^Mi^ Internet A hyper-link is an underlined or otherwise emphasized portion of text or 

:%4''' 19 - 



^ l^lgraphical Jmage which^ when clicked using a mouse, activates a software connection module 
^^which allows the users to jump between documents (i.e., within the same Internet site 

(address) or at other Internet sites). Hyper-links are well known in the art. 
V I ' One popular computer on-Hne service is the Web which constitutes a subnetwork of 
.^5^ J^j^pnj-lme documents.withinlhe Internet The Web iricludes graphics files in addition to t^t 
^ffilbs and other information which can be accessed using a network browser which serves as a 
.^graphical interface between the on-line Web documents and the user. One such popular 
/browseris the MOSAIC web browser (developed by the Nfational Supercomputer Agency 
i) 'W^,^^^))'-^ web browser is a software interface which serves as a text and/or graphics link 
lO*' t?i$5tween- the user^s. terminal and the Internet networked documents. Thus, a web browser 
' ^ ^allows the user to "visit' multiple web sites on the Internet. 

^ ' ' Typically, a web site is defined by an Internet address which has an associated honije 
^f ^&page. Generally, multiple subdirectories can be accessed from a home page. While in a given 
home page, a user is typically given access only to subdirectories within the home page site; 
;i;5 \ ^ho^wever, hyper^links allovv. a user to access other home pages, or subdirectories of other 
^home pages, while remaining linked to the current home page in which the user is browsing. 
' /' >r ' Although the Internet, together with other on-line computer services, has been used 
^ 'A twidely as a means of sharing information amongst a plurality of users, current Internet 



^brciwsers.and other interfaces have suffered from a number of shortcomings. For example, the 
. 20 ' i^brganization of information accessible through current Internet browsers and organizers such 
' vas.Micrqsoft Internet Explorer or MOSAIC, may not be suitable for a number of desirable 
^.^^^applications. In certain instance, a user may desire to access information predicated upon 



V A „ 'St , 

, ^^^^.categones as opposed to by subject matter or keyv^'ord searches. In addition, present Internet 
■ organizers do not effectively integrate the categorical information in a consistent manner. 
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In addition, given the large volume of information available over the Internet, current 
' j systems may not be flexible enough to provide for organizatton and displ^^ 
Vt^t^^f kinds of information available over the Internet in a mamier which is appropriate for the 
s ^ amount and kind of data to be displayed. 

5-^ ', Figure 3 is a system overview in accordance with a prefened embodiment of the 
. Pfesent invention. A plurality of user computers 3, 3a and 3b are coupled to a network 2. 
' ' I' llSli^^^'^'^ ^ alsVcoupled to another network 2a which itself is coupled to other computers 
4. shown). Computer 10 is also coupled to network 2. Coupled to computer 10 is database 
^^1 pE)atabase 1 contains^ a plurality of records (not shown). 

The network 2 maybe a private or public network, an intranet or Internet, or a wide or 
V ^ local area network which not only connects the user 3 but other users 3a, 3b and other 

, S^i^networksla tb^cbmputer 10: 
, \ , For ease df understanding, in the discussion which follows, the network 2 will 

r 'comprise the Internet, though this need notbe tfie case. 
*^1^5r It should be understood that electronic product catalog 1 comprises a multiple- 

I/*J^°°°^y'^3t^8'^"ze'i electron^^ product catalog. In such an electronic product catalog the 
* ^records have been tagged or otherwise categorized by more than one taxonomy. For example. 
' ^ m& .records in electronic product catalog 1 have been categorized by the taxonomies "Price," 
r^^J^e." "Brands" and "Promotion." In this example, the records have also been categorized 
f^><^y additioAal "step search" takonomies, but these taxonomies (such as "Color," "Style" and 
^ ^j'fSize" if the user has selected a clothing category, or "Monitor Size" and "Memory Amount" 
, if the user has selected a computer category) are not presented as options until the user has 
,«^l;led d6wn to a specific category in the "Product Type" taxonomy. 

In one embodiment of the invention, computer 10 receives search requests in the form 
-25' ,^;,9f data (hereafter referred to. as "search-related data") via network 2 from user computer 3 . 

21 
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^llpiearch-related dataxpmprise a search term entered by a user to initiate a keywpixi search, or a 
.-.•!^9i»omy.er«teg<)iy.-8electedfcy the.user by "clicking on" a portion of a screen. 

The category and/or taxonomy selected by the user and sent to computer 10 is a way 
for the user to navigate a Web site. As such, the category will be referred to as a 
^5 , ^"navigational category" and the faxonomy will be referred to as a "navigational taxonomy." 
' '■V^I • For examplci when the user accesses a web site, like web site 4000a or 4000b in 

' Figure 4, he/she is presented with an initial screen which displays taxonomies 400 1 and 4O02, 

, , .-namely 'Xocation" 400 1 and "Products &-Services" 4002. The user may then insert a search 

• I ■ ■ 

.?i,>term 3001^;and select a taxonomy 4002. After selecting a taxonomy, the user tiien selects a 

I r t T! . ... 

Once computer 10 receives the search-related data, the present invention utilizes the 
navigational taxonomy 4002 and category 502 in the user's search request to determine suTj- 
N >, categories from the hierarchy associated witti the navigational taxonomy and. category. 



r ' I For instance, if the category 502 comprises "Physician," then the process might yield 



Jr 



,l5^^^^&ub-categories 503 shown in Figure 4000b. One such sub-categoiy 503 is "Neurologists^* 
] Sub-categories 503^will be referred to as "navigational sub-categories." 

Qnce computer 10 :lias determined the sub-categories 503. it then can launch a search 

A .^^jfi;fdirecte#o4atab^^ ...... 

[^iv. ^^^^ appreciated that the present invention envisions computer 1 0 launching 
20 .search queries aimed at database 1 using sub-categories 503 which are not selected by the 
usCT. Rather, these sub-categories are dynamically selected by computer 10 based on the 
t^ taxonomies and/or categories input by the user. 



'ftV< Acpording to one embodiment of the present invention, a search query may be carried 



out in a number of ways. 



: "V ' " . ' For example, in one illustrative emboidiment of the present invention computer 10 

ilaunehes a search query comprising a search terra 3001 , a taxonomy 4002 and sub-categories 
• f^' ^-0^1:^^^^ *° database l. • Computer 10 compares the navigational taxonomy and sub- 
-,^<?ategpries 503 to the database taxonomies and sub-categories making up database 1 . If a 
5. ■ ^r^ptd is tagged with a database taxonomy and a sub-categoiy which matches a navigational 
; . ,yit^^conomy and sub-category, then that record must contain characters which are responsive to 
\ : • 4:^;*^^^^ After a match is detected, computer 10 compares the search term 3001 

... 1 .^^^^^ those records having matching taxonomies/categories. 

I matching records have been identified, computer 10 generates a numerical 
, vfPv^v. count of all of the records within database 1 which have a character string that matches the 
V i' ^ U^^^^^ This numerical count is further broken down by sub-category. For example, 

? ; iSa^liife shows "428,935 Listings Found" for the category "Physician" 502. Within this, *77" 
'\; ;>t^^|reMe^^ sub-category '^Neurologist" 504. 

. ^ In another embodiment of the invention* computer 10 launches a search query 

15 s compnsmg only a category or sub-category without a search term. This enables a user to 
0 ;. . Vfe;-'^^^^ through database 1 merely by selecting a narrower and narrower sub-category, 

• ^ ' ^ t !?^!^!^^ another embodiment of the invention, computer 1 0 is adapted to launch search queries 
' : i.^ IV / " ^"^y ^ search term or terms. It should be noted that computer 10 initiates any one 

%. ;'^^Qf^^^sp^typQS of search queries at any level of drill-down. 

'■}00^iM^ illustrative eiiibodiment of tiie present invention, a user may also drill-up 

1 , f;f of categones/sub-categories. For example, once a user has drilled down 

- ana:reaehed the level represented by screen 4000b in Fi^^ 

' . ' .' , 

; , category "Healthcare Providers" 505, and upon receiving this category as search-related data, 
V . . c<>,?^P^ter 1 0 returns to screen 4000 in Figure 4. fn addition to drilling-up. the user 3 may 
^ ;>^f r" ;: .; ?s\yitch taxonomies at any point in a drill-down or up. For example, the user can click on the 



'^opation" taxonomy 4001 in Figure 4 and be presented wift categories conresponding to this 
S.V^^^onpmy a^^ all previous search constraints are maintained. In all cases, when the user 
'%icks'on or otherwise selects a taxonomy, category or sub-category, computer 10 compares 
Mtfei search-related data to a hierarchy as previously explained. A search is then launched by 

.5, ;'^^Qmputer 10 usirig navigational sub-categories which result from this comparison. 

^V:'^tMp,ilv!;.^. , 

. Figures 5 and 6 provide display screens 500O and 6000 depicting other examples of 
■t ^ :jj|K^^^^^ a search using two or more taxonomies 5001 , 5002 can be displayed. 

- ->^ginning with Figure 5, there is shown an example of an initial screen 5000 which displays 
V I J=^^8?>rf?s 505 which mdce up a 'Troducts and Services" taxonomy 5002. Though only a few 
; ilp. 1^^^ are shown, it should be understood that categories 505 may comprise any type of 

or service/ or some subset. In the example shown in Figure 5, the user types in a 
J^^^^^^^^^^^^ The 
: ! V '^H^^^^present invention, however, is not limited to displaying the results of a search against only 
' ' : on one screen at the sanie time. Rather, the present invention can display the 

.''^'X, V .V.'/ - 

Vi^^iii:)^'Slte=^o against multiple taxonomies on one screen at the same time. 

:r^^:.{fi;'^^^^^ Computer 10 then selects imvigatioiial sub-categories 506 which correspond to 
' r X^^^:%^cat^on*' taxonomy and subsequently launches a search query against database 1 using 

search term 3002, taxonomy 500 1 and sub-categories 506, It should be noted that both 
: ; , iiK|ta^^ 5001 , 5002 are provided to enable a user to initiate a search using either 

Goritinuihg; Figure 6 depicts an example of a screen 6000 generated from the results 



%of 1 the just described search query. As shown, the screen 6000 displays categories 

. ,: -t^JSOfr which are'nm related to the "Location" taxonomy 5001. In 



' = ' ^f il^^^l^ti^^*^^^^^ *® number of records containing characters matching the search term •'neurology*' 
. 25 V ^ 3002 is also displayed. As before, this number is displayed as a total and is also broken d6\vn 
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WoTi^each subrcategory* Forr.example, next to. the sub-category "Virginia" is the number 
» ^pS5^,55l number of records within database 1 that contain data or 

' teharaeters representing neurologists withi 

' ' It should be understood that the user need not input an additional keyword to further 
5 ^^^^rfarrow his/her search; .Instead; computer 10 generates mtuitive sub-categories 506 which are 
^-gresented to the user.for the very purpose of narrowing his/her search. In addition, the 
. number of matching records for each sub-category is displayed without the need for the user 
i , jfib^individually launch separate searches aimed at each sub-category, 
I iii 1 It should be understood that the terms "category and "sub-category" are relative 
10 ^terms and in some instances may be used interchangeably. 

The ability to switch among taxonomies, to drill-down or up, or to switch among 
|H||taxonoime$ while drilling down or up enables the user to navigate a Web site and 
\ ^t^cliWesponding database I with great ease. This ease-of-navigation can be used to enable new 
revenue models. In . one embodiment of the invention, new revenue models, such as 

,15 advertising models, are enabled from such easy-to-navigate Web sites. 

H \ 

Taxonomies and categories/sub-categories can be analogized to aisles and shelves in a 
I ^ '^ tfgrpcery Store, A user fmds,the shelf ("category**) he/she is interested in somewhere in an 
^ ^ Vtaisle C'taxonomy^') comprised of multiple shelves. In brick-and-mortar grocery stores ( i.e., 
, ' ^"^^.?pllys^cal, not Internet stores), companies.have sought to catch the eye of a shopper as he/she 
20> i^^j^scans a shelf by placing advertisements next to their product Ideally, the shopper will notice 



v;|*^P andjbe enticed.to buy^the product over other similar items on the same shelf that have 



advertisement associated with them. The present invention envisions the enabling of new 
l^aii'yertising revenue.models based on the selection of aisles and shelves (jXy taxonomies and 
' categories). 
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' Figure 7 depicts an advertisement 7000 generated when a user selects the category 

^Health Insurance & Information" 7004 in the "Products and Services" taxonomy 7002. 
' '^Jilsing the aisle and shelf analogy again, the user first selects the *Troducts and Services" 

a^fsle, scans the aisle and determines that he/she is interested in those shelves associated with 
b Sicalth Insurance & Information," selects those shelves and is presented with a list of shelves 
which are related to "Health Insurance & Information." The user can then select the specific 

shelf or sub-categoiy 7003 which he/she is interested in. Unlike a physical grocery store, the 

. t ' ^^''-^ ...... 

aisle'' that the user has *'walkedV down is actually two aisles. All of the products on the shelf 
* f%a\;e been organized by ''Location" and by "Health Insurance & Informatiori," Thus, as the 
10 \user '^stands" in front of the shelf associated with 'Health Insurance & Information," he/she is 



^^lalso ''standing" in front of a shelf which is also associated with some subset of the "Health 



L I 



^ ^Insurance & Information" aisle. In the physical world, it is as if each end of an aisle has two 
"^jEsigns^ one labeled "Location" and another labeled ''Health Insurance & Information." Dovvn 
4the aisle are categories of items which are associated with a specific location or locations and 
15 ^^particular products and services, 

^ ^yj^f In one embodiment of the invention, computer 10 selects advertisement 7000, based 

j< i ... 

^ '^'^ ^on the taxonomies, categories and/or search terms input by a user, in this case, based on the 

user's selection of the category *'Health Insurance & Information" 7004. The selection of 

such an advertisement will be referred to as "attaching" an advertisement based on the search- 

?),20^^|^elated data input 

"^^^ Computer 10 attaches advertisement 7000 only when a user selects the category 

"Health Insurance & Information" 7004 for example. More generally, computer 10 attaches 

advertisements based on real-time, instantaneous actions (e.g., selection of a taxonomy or 

^ J category) received from the user. It should be understood that any type of advertisement may 

' 25§f(, \ht attached by computer 10 in response to search-related data supplied by the user. The 

26 
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s^rch-related data supplied by user begins as preferences in the mind of flie user. As Ae user 

(^^navigates through a Web site he/she makes choices based on those preferences. These 
>^choices are manifested, in the taxonomies, categories, sub-categories and search terms 
; selected or.otherwise input by the user. 
5 ^ Computer 10 also attaches an advertisement at any point during a drill-down or up, 

. l^wHen a user, switches taxonomies, and/or upon the input of a search term. 
' ^ Th^ ability to attach advertisements based on real-time preferences of a user is useftil. 

;In particular, this capability allows on-line publishers to use new models to generate revenue. 
• ^ Publishers will no longer need to rely on a circulation rate model. Instead of sellmg on-line 
110 y iadverUs^ments based solely on historical, circulation-related criteria, advertisers can establish 
, > ^ '^'^Vevenue models based on real-time user preferences. In one illustrative embodiment of the 
\ I invention, publishers can charge different dollar amounts by category level. For example, a 
" publisher may create a multi-tiered advertising rate structure. Such a model may comprise a 
' first or lower tier and subsequent higher tiers. In an illustrative embodiment of the invention, 
,^1^5 ' ' ^^tiie lower tier may comprise a relatively low dollar amount with each subsequent higher tier 



\ ' Comprising an ^increased dollar amount. In addition to linking each tier to a dollar amount, 
' '^^ computer 10 links each tier or tiers to a category level. For instance, the category **Health 
. ' Insurance & Information" 7004 may represent one category level while the "Location** 
,,^\tfp^ommyl002 tnay represent another. In an illustrative embodiment of the invention, 
'iCf computer 1 0 links each of the levels to a dollar amount So, one level may be linked to a low 
* ^ ^dollar amount while another level may be linked to a higher dollar amount. 

A publisher may generate revenue from such a model as follows. If a business wants 
' ( advertisement to be seen whenever a user is attempting to locate a pharmacy, a publisher 

rl^may charge a fee of $1 ,00. Each time a user selects the "Location" taxonomy 7002 the user 

^^^^ .. - ■ . 

25%} ^ would see an ad corresponding to this search level. If, however, a business only wants to 
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■ 1^^^ ^'^^^ a user needs a retail phaimacist, then the publisher may charge a higher 
I ■ ' ■ 52.00 to allow ad 7000 to bedisplayed when a user clicks on the category "health 

: & information" 7004. In one emhodiment of the invention, computer 10 attaches 
. ' j:;v , v!:#to categories located .farther down a hierarchy for a higher cost than ads closer to the 
5 - beginning of the hierarchy.; The rationale behind such an advertising model is that businesses 
.: ,, , '.:are:'wiH to pay higher advertising rates to reach those users who are engaged in focused 
' .Sff^' ^ ^"^'temative embodiment, higher rates are applied at higher categories because 
view these categories than- individual sub-categories. As can be imagined, any 
^'M ^^^^ ofimoddsvcan bccreated. These include, but are not limited to. the following-, a 



comiputer 10 attaches ads to categories located farther down a hierarchy for a 
. ■. ■ higher cost than categories.at the beginning of the hierarchy; or a model where computer 1 0 
. * premium cost to categories within a hierarchy. In these models, the 

'M^^^^^"^^ deterrainedbythebreadth or "direction" of the search, Le^ drilHngiip or 
''S"/^;^^^'^' J^anothermodel, the-advertisingrate is based on the popularity of the 
' 1;5 Z / ?^te©)ry*or on tiie uniqueness of the category. 

. Figure Sdepicts screen 8001 generated in accordance with an alternative embodiment 

v^^^P?^^^"*'"'^®"*'®"' embodiment, computer 10 generates advertisements 8001 

. ■ ':;|;!|fffiW^^^ initiates.a search which- includes a search term which matches a term used 



.iwitltin^^ 



For purposes of explaining Figure 8, it is assumed that the user has drilled down using 



clicking on the 



• ':iS35^3f ^^^^ Upon 
; is displayed. The ad 8001 does not comprise a 

' •; '^bg^er'-^ advertisement, such as ad 7000 in Figure 7. Instead, it is a "display advertisement 
■ ; ; / fqrsa particular business, in thiscase a hospital. In an illustrative embodiment of the 
computer 10 attaches an advertisement when the search initiated by the user 
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■ '^^"tainsacWter.stringwluchmatchesacharacter.^^^^ ^Figure 

iU'fc'*^'**''''*''''"^"'^^^^^'^'^^^^^^ also 
; ;^ltheseatcK^tenhv3G02 frora,Fi^^ This is a fomof syndicating an advertisement from a 
; ; merchant to a user. The present invention allows the merchant to build his/her advertisement 

' ' ^ . . . . ■ 

' .5, - in any format and have itdistrfbuted-TTTUs, the present invention acts as a collector and 
^ syndicatorof data. 



i' >^ tii r . "ser preferences are manif(»ted in the taxonomies, categories and search . 

. ' ^ ; tennsselectedorolherwiiseinputtedinlDaWebsite. As illustrated above, these stored 
' h ' |i)references can be used to focus a search by selecting intuitive, navigational subcategories 
'"^2,^°"' ^ "^'^"'^^ of categories/subcategories. These preferences also trigger the display of 
, *° P^fe'«"<=«s or at least to the perceived preferences of such 



' ' .S:', i These real-timepteferences can be used in other ways envisioned by the present 

^ Po'-^x^Pk, the present invenUon envisions computer 10 tracing user 
>!> ■ 'P'^^®'®"*'^- '^'^ ^^'''"g >s done in near real-time and allows a business to follow a user as 
r''; J V^b/she-works her way through a website using taxonomies and ahierarchy of categories. In 
; ^^^^^^'''^^^^'^^'^^''^■''^■^ tioraputer 10 stores the taxonomies and categories 

^selected by a user to determine, for example, the products and services preferred by the user. 

' ^^M''*'" ^ ^ ^"""^ '° ^^'""^ '^^^''^ ^''"'^"'y within the data collection 

2^Sla'erafch^%eit-a'ds-sli6dldbe attached. 

Figure 9^^rovides a scherhatic of the data as it is stored and organized in a database in 
-VM f;apcordance withapreferred embodiment of the present invention. The database 905 contains .. 
-.inany r^rds, :905aj 90^,-and 905c. In this example, a record is a single unit of identifiable 



'WP^' include individual Web pages, text documents, collections of 
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video, still image, audio data, or any combination of these. It should be noted that there are 
; /pother types of data that may be' grouped together to form a record. 

\ Three exemplary: records are shown in Figure 9, Record 905a is a plain text 
/docjiment Contained within this record is a word such as ''tires " A record such as this could 
5 ^e an HTML page (or XML document or database record) attached to a service station's main 

' V ^^^^ 

' '^liome page. Once a user has accessed the home page, he/she would click on a link to access 
y ' -jthis text document to learn what services this station provides. 

Record 9G5b is a home Web page used to advertise a tire store and Record 905c is a 
home Web page used to advertise a physician's clinic. As shown. Record 905c includes text 
^ descHptibn of the services provided by the clinic and a graphics interface fomat 
' " ti^^^ file that is a map providing details on how to get to the clinic. 

w . Indices/databases 910;9l5a and 915b are used to access records in database 905^ 

I, ' 

' ' s ^tf^^^^^ index '902 contains^a listing>of all the key words and phrases 9 1 0 in all of the records 

^ '^'■^ ^' . ■ ■' 

' ^ , ;;^|in;(latabase:^905, and other indices 915a.and 915b.. Examples of such key words and phrases 
'^"^Ff li^de 'tires," "batteries/' ?-safety inspection,^ "allergies," "broken bones" and "family 

^' if , , . . 

, ' ^medicine." Attached to each ofthese key words and phrases are links 910b. These links 
/ ^/refCTence^each record in mdex/database 905 tiiat contains these words and phrases. 

todices/diatabases 9l5a and 915 b represent different taxonomies of database 905. As 
",,i.!fe;sho'wn by thes headingSi index/database 915a is a "Product/ Service" taxonomy of database 905 
i';'20s',., Wdlindex/databaise 915b is a "Location" taxonomy of datobase 905. 

These three indices/databases 910, 915a and 915b are used to access the records in 

' ' 

'^database 905 in three different ways. Index/database 910 receives search tenns or phrases 
i'j^and is scanned- to locate those key word or phrases. When a hit is discoyeted, the number of 
;*$»P^ 9l0bilhat reference^o database 905 is flien determined. 
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■^^l: / Indices/databases 915a and 91 Sbprovide data collection lists of their respective 
'■'Xi^S^^&nis in response to user input As an example, if the user clicks on the 



♦ '-'^ ?^J^^ucts/Services" taxonomy, all of the categories within that taxonomy are displayed. Two 
.^ptHpse catiegories include "Physicians and ''Automotive." As shown in Figure 9, each of 
5 ;^,^e categories is'd^ 
! • Aiiifti^yice;* !*Allergists,'^'"Caidiologists" and "Radiologists." 

:i: IiMJex/database 915b is a taxonomy of database 905 based on ''Location." Within 

'■ ■ ■ ■ ' • -^fe.-' . • 

$15b are categories. An easy example is a listing of states or countries. Each state 

? : ^ itiSl!^ By having multtple taxonorriies of the single database, multiple paths are possible to 
■ v '^^^ ^^^^ records. Figure 10 shows one set of queries from a user and the system 

. ;^?i^onses that represent a path a user may take to reach the records he/she desires. The user 
|v a search term againist the "Products and Services" taxonomy. In the 

' -"■ ■ ■ ■ ,■ .'' v'(l||;l 

j:?^tennines that 36,653 records in the database have the word "tire" within them. 

The present invention then determines the categories that are associated with the 



, the ^iearch tferfn is *'tirei" The present invention queries term index 910 and 



Iearch tenh '*tire". For exiample, almost all of th6 records that have the search term "tire" in 
^feare^^tegoirized intb th^ group of *^\utomotive " The user selects the "Automotive'* sub- 
iltegory and the preise^^ then searches through index 915a to determine how many 

idsMeicbrds within each of the sub-categories also are associated with the search term "tire." As 

' "fiiiw only 254 records organized into the '\Automobile Dealers" category 

;:ilv|if^S|i:' ■ ...-..* 

' V "tire" while 13,887 records organized into the **Automobile Parts & 

T " ; '^^^ category contain the keyword "tire," Thus the present invention compounds all of 

v ^ J^tHis data iand provides it to the user. It should be noted that by pushing data back to the user, 




case a glimpse of the organization of the categories, the user can learn how best to 
■ / ;^prpceed with drilling down into the data. 

"^^^^^f responds to the list of subs:ategoriespiwided by the present invent 
•■ ■• y^'^^yecting one. In this example, the user selects the sub-category "Automobile Parts & 
5- Supplies". 

; . The system responds by providing a list of all 13,887 listings that are associated with 
: . "tire/; This list is unruly for a human being to wade through so the user 

5 V Y W^"^'^ "Location" taxonomy in response. 

; i& system responds by cross-matching the 13,887 records against the categories 

/' "l!^!^^*^*'^ Thus, the system generates a directory of these 13,887 

■ ;-|?^f|S^fds as;organized by state (Le., Virginia has 303, etc.). 

■■• 'yt^fi'ii^fiv': ■■■■ ■■■ 

■ ; ■••^J'SS-' '^W'i^.to these sub-categories by selecting a particular state, say Virginia. 

■ ISS^ ^sP^"<is by cross-matching the sub-categories within Virginia. In this example, 
: ■ ■ \7||'s^|^sSub-categories are the various counties and city municipalities within Virginia. Once the 

'i^ ^^v/^^^^'™**^^"^^? completed, the system provides the user with a list of appropriate sub- 
■ '■ l^p*^^"^ records match the search so far. 

I- 'Sl^fel? responds by selecting the sub-category "Service." The system responds by 

iding a list ofallofthe records that match thesearch. The userrefines the search via the 

■ ::;«t^J||^eatiorf' taxonomy, Thus, the user selects the "Location" taxonomy and the system 

by cross-matching the records associated with the sub-categoiy "Service" with the 
■f ; taxonomy (Le,, cities or counties m Virginia). The system then 

I: .■^^Pf 1^ the Usting of categories with the number of records associated with the sub-category 
• , : ; Service" and each city or county in Virginia. 
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' systen^T^sponds by listing the sub-categories under the category "Virginia" 

: ; ,0^, "Alexandria." "Fairfax County," "Arlington County, » etc.) with the number of records 

; - ^associated with "Service"'in parentheses. 
, ■ ■ The user selects a listed sub-category. Following the above example, the user selects 
5; ^^.■tAIexandria." Tlxe system responds by listing aUoflhe "Service" associated records that are 
, *alsp associated with "Alexandria" in 'Virginia." 

: |! The user responds by entering the search tenn "tires." The system receives this query, 
; ... -■..i^^^^^ records associated with the search term "tires" from free-text term index against the 
Jeims stored therein and cross-matches those records associated with the search term "tires" 
:^^^0^1h^]kt^Tccoris. TOs produces a list of 15 records that match the search. In this 
*^ "^'^ '^'^^ '"^^^'^ t^ono'ny "Location;" the category "Virginia;" the 
^.■•|4x^^^^ and Services;" the category "Automotive;" the subcategory "Service;". 

•„; • - tlie^onomy 'location;" the category -Virginia;" the sub-category "Alexandria" and the 

"^^'^ ^"^^ exaihples demonstrate the versatility of the present invention. First, the 
J'\«^user.is not required to go through a specific path to reach the desired number of records. 
.While the above examples show only three paths to reach the desired set of records, itcan be 
' , . ' • ajipreciated that there are multiple paths to reaching the same set of records 

™' P^"^"^ achieved by the independence of the taxonomies shown in 

2^;^fi<fegure 9. By keeping these taxonomies independent, the usermay switch between which 

Vjtoxpriomy'he/she wishes to use to consider the data and make queries into electronic product 
, • : ^Jeatalog 905. The level of the seareh that the user uses to make a decision to switch among 
iri^^'^ also arbitrary and up to the user, with the exception of any "step search" 
■■ ' ''teohomiesthathaveiiotyetbeenpresentedasoptionsatthatstageofthesearch. This 
,25 . allows users who are more proficient in developing searches to use their proficiency in one 
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\ ^'^taxonomy index to whittle the number of electronic records down before going into another 
' ' ^taxonomy index to finish the search where the user is less proficient, and vice versa, 
j \ Another feature of the present invention is the« pushing of data to the user. As noted 
^ ^h6ve, the user receives (Mtegoty and sub-category information when a query via a search 
5 'Verm is used earlier in the process; As noted above, suppose the user is looking for •*rims" for 
ihis/her car, instead of tires. By typing the search temi "rims " the system will provide the 
* 'category list to the user so that he/she can drill down into the data. Thus, if there were a sub- 
'^^ub-category of **tires" the user would eventually see that sub-sub-category and make the 
* ,4'association between "fires" ahd '-rims." Thus the user comes in contact with a useful 

10 /iCategory or subcategory that he/she can use to search for desired information. 

1/ ' . . . 

' i\'> The present invention is also useful as a new method of domgbusmess. More 

-^1 4 '^'^^isi)jec the present invention may be used to advertise items in the database for 

. 1 ,vmerchants or manufacturers. In this business model, a plurality of merchants submits records 

, , , f diat advertise their stores, goods and services. Such a record could simply be a copy of a 

45 ^ l^^Web page that includes the merchant's line of business, address, phone number, a map 

showmg the location of the store, hours of operation and a picture of the storefront. It should 

;^ f ' be^ noted that this example is not limited to physical stores, but may also be implemented 

^ f %^sing virtual stores. Additionally the character string search permits a user to receive 



^ N./^dnfonnati&n directly from a merchant or manufacturer. 



Mil 



20 These records are categorized so tfiat associations are made between the categories 

^ and sub-categories in the nlultiple taxonomies and the records. In addition, terms within the 

iij'fe .records that correspond to terms in the free text terra index are determined. Associations are 

f I'j'^then maide between these records and. the various categories and terms in the indices. 

These records act as searchable storefronts for the merchants. Since the records or 

25 , storefronts are categorized, a consumer may use the organization of the categories to locate 
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■ ' ^ example, assume a consumer was trying to locate a phaimacist to 

/ ii^piiJfa prescription. The consumer would select the. "Products and Services" taxonomy. The 



'■..••t*S«"«f.S:'-'' > . 



.'^^ "^spends by providing the list of categories and numbers of records associated to each 

' : ' One ofthese categories is *Healthcare*'.which the consumer then ^ The 

' ^;5^ ^^|s!^^^ displaying all of the sub-categories of "Healthcare" such as "Allergists/^ 

''■'^^'^^^''>:^' ^^^^^^^^^^ selects the sub-category "Pharmacists/* This sub-categoiy is the end of 
• ' - "'^^ example. Therefore, the system displays a hit list of all records that 

. i; vil^M'^^^^ "Phannacists." If the database is large, there could be thousands of 

-■^'^■^^^^^ sub-category. To put a number on it, this exemplary database has 24,346 

C -i :i^^smM^ki'>^-': The consumer wilb then want to limit the number of hits by viewing the records 




l%ith the sub-category "Pharmacists." He/she does this by drilling across to the 

': ^ )^?||^|^^^^^^" taxonomy, which instantly reorganizes all 24,346 records into geographic 

^ \; i selecting the category "Virginia" and the sub-category "Fairfax County" the 

;rn5|^^^pi§^'^ the- records to just those pharmacists in Fairfax County, Virginia. 

■'' consumer- has used the records or virtual storefronts to peruse the vast number of 

■ : :f t|pS^ offerings to find the merchant or merchants who can best suit his/her needs. This is 

V ^m/s^I^^I^^^S^^"^ ^ ^ consumer in that he/she does not need to drive around the neighborhood 

I; : f ji^te storefronts to leam what each business is selling. In addition, 

■ ^rf w^ be pushed to users based on a given search criteria as previously 

l^^*^^*^^^ description of Figure 8. 

' ' '^m^il-: ™^ system also has advantages to the merchants. Suppose a merchant does not want 

' f ! '^f^'^^^*" the costs of maintaining-a Web site. Maintaining a Web site also requires that the 

• -i^W^-^ search engines can locate his Web site and allow the 
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' "consumers to access it. In other words, a Web site that cannot be located will not lead many 
tjonsumers to the store. 

In this embodiment; a merchant or user may spend a small fee to submit the virtual 
' , ^^orcfront/rifcbrd and avoid the costs of maintaining a Web site. In addition, by virtue of the 
' 5; \ '.se^chability of the text of the record/virtual storefront, the merchant is assured that the 
tecSord/virtual storefront is locatable. 

Another advantage of fte present invention is the way results are provided to tiie user. 
/^As.noted in the many examples above, much of the sifting through the database is done via 
0$hl categories and sub-categories, hia preferred embodiment, there are many more records in 

\0 ^ '>he database thafa there are categories. As an example, a search term may be associated wit^ 

' " t 

\ ^i|hbusands of records, but only one category. Providirjg a list of thousands of records requires 

' -<«<|; • ■ 

''f^^i^j'ot of data handling in both the transmission of the data to the user, as well as the displaying 
' f^cif'tihe data tc> the user. Providing a list of only one category is much less data to transmit and 
Y^i'splay thfe makes the invention ideal for use with devices with small screens, such as cell 

15 "^{phones, imagers, and personal digital assistants (PDAs) and palm-held devices. 



Figuie 1 6 is representation of a portion of the data stored in structure 902 and how 
' that dat^i is organized in accordance with a prefenred embodiment of the present mvention. 
- *WNode 1605 represents the category "Virginia" from the "Location" taxonomy Node 1610 

'f iftrepresenti the sub-category "Arlington." Node 1615 represents the sub-category 'Tairfex." 

'J^m\ ■ ■ . .. ^ 

: 20ii r>Node 1620 represents the sub-category "Service" from the "Products and Services" 
taxonomy. Record 1625 represents a single record. 

1 , ' Linking the nodes aind records are category code words. Category information is 
' ''S^' fstored% 'the inverted index as an encoded category codeword. Leading into node 1605 is a 
category code word called ''VA." Leading into node 1 61 0 is a category code word called 
25 "AR," Leading into node 1615 is category code word "FX." Leading into Record 1625 are 



^%m) 0im728 IP ^'/USOl/l 0 185 



' '^^^^ ^ representation shows how the various ciategories relate to each other 
''^"d the records. 

In one embodiment of the present invention, these path names are stored in inverted 
^ index 902 and used to retrieve electronic records. This structure provides several advantages. 

•J " V f * . • 

5 ..Jlh^one embodiment of the present invention, these path names are stored in inverted index 

'^m ' ■ ■ . • 

I ^\^?021 and used to retrieve electronic records. This structure provides a means to pertonn 
. ' /^Boolean operations on the path names to calculate category count results and to identify 
' , i'records that are identified by those category paths. 
^ > It will be appreciated that large global collections of data can be broken down into 

10 , V ^4mller siih-coUections. The sub-collections can be stored independently one from the other, 
' ^ ' ias|in separate phj^sical locations or simply in separate data tables within the same physical 
' ^Vocation, and can be coniiected one to the other through a network or stored locally. As data 
^are added to the large global collection overall, it can be sent and added to individual sub- 
^ icollectidhs and/or can be formed into a further sub-collection. For instance, data entered by 
15 ; ./^'educational institutions and scientific research facilities can be stored independently in their 
o'^ /yJirpwn data storage facilities and connected to one another via a network, such as the Internet. 

TKus, as can be seen, the present inveiition can be implemented with very little or no change 
' v^^^ r^^an the present protocol for data collection and storage. 

\l ;vf'%v^ If will be appreciated that the present invention provides a search interface that can 
20^ iiaggregate disparate databases and make the disparate databases searchable through one 
/ . ; mterface. 

Once the individual sub-collections have been identified, each performs its own 
^indexing fuuctiori. In carrying out the indexing function, each sub-collection creates its own 

> % 

sub-collection taxonomy consisting of statistical information generated from what is 
>as \ commGnly referred to as an inverted index. An inverted index is an index by individual words 
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^listing electronic records which contain each inidividual word. The indexing function itself 

'"^can be carried out in any method. For example, indexing can be performed by assigning a 

weight to each word contained in a document From the weights assigned to the words in each 
■• • • ' 

'^^documeh^ a sub-collectibh view (i,e., the statistical information derived from the inverted 

V ^I^^^W*®^)^^ created upon completion of the indexing functioa Regardless of how the sub- 

v^collection indexing is carried out, each sub-collection will have its own independent sub- 

.^l^^llection view based upon that sub-collection's inverted index. When data information is 

\ ^ i^^^^^ sub^cdllection, the indexing function is carried out again and the sub-collection's 

^ ? ^^yiew can;*e re-compiled from a new inverted index. 

^ ^ ir . . . 

AOi- ^ ^ Upon cdmpletibn of each sub-collection view, certain statistical information about the 
sub-collection view is gathered by a global collection manager to form a global collection of 
i ipparameters, statistics, or information. The global collection manager may either request from 
, sub^ollection that it send its sub-collection view, and/or each of the sub-collections may 

§spontaneously send the sub-collection view to the global collection manager upon 
I5f '^4'^ompletion. Regardless of whether ±e taxonomies are requested or spontaneously sent, upon 

^collection at the global collection manager of all of the sub-collectiotfs views, the global 

) ] ^ 

. ^^%<>llectiGn manager buildVa "global view" on the basis of the sub-collection views. 

' ^^Necessarily, the global view is likely to be different from each of the individual sub-collection 
' ^ ^views. Once the global view has been compiled, it is sent back to each of the sub-collections. 
20^ [n this manner then, a distributed data retrieval system is buih and is ready for search 

.{.and retrieval' operations. To search for a particular piece of data information, a system user 



H«^^simply enters a search query. The search query is passed to each individual sub-collection and 



'^used by each individual sub-collection to perform a search function. In performing the search 
sub-collection uses the global view to determine search results. In this manner 
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VSpn, search resdts across each of the sub<ollections wiU be based upon the same sra^^ 

Mji ' ' • • •• " • 

• ■ v&v';:.- ■ " ' ' ' ' 

■ ' : The results of the search function are passed by each individual sub-collection to the 

, ; . nianageir,,or:the computer which initiated the search, and merged into a final 

X^fVill^ search result Thp,final global search result can then be presented to the system user as 

; • ; . :v:("^ ;^a^'c^ search of all data infomation references. 

. r: . ■ 

, ;r: ; r The labeling of these paths also reduces computation time for other searches. For 

: 

)• X ;^^|^^ search (i^ h store X within 5 miles of apartment Y?), 

.{ ' V ; v /ithel^^^ invention can be used to make this detemiination. For example, if in one path to 
1,0 ; i ' tlie.record associated with store X is the path name *'SC' for South Carolina and in the 
;;;; ; ; vipbrrespondi path tO; the. record apartment Y is the path name "MD" for Maryland, the 
; that.the answer to this query is No by merely referring 

"'ii . Itshould be .noted that other variations are possible wit^ 
: ■ o;t(^iJ^iv^if^>^^^^ departing from the scope of the invention. For example, tfie number of 



to 



. . ' ^:,--j:';;fcharacters used to describe.a category is not limited to two and may in fact be any number of 
' • ' ;^'-V^%racters, Additionally, the category code words need not be limited to letters but may 
;; . vcncpmp^ss^ numbers, syriibols or a combination of letters, numbers and symbols. In addition, 
'A- -V'^^Hifi0^^ code words between the base node and each record are determined, they 

V ' 4 withmthe records as tags in a preferred embodiment of the present invention. 

.•^^ K';. \ Figure 13 shows a, system overview in accordance with an embodiment of the present 

i, ■ r . . -v' computer 505 is the central point It receives queries from and provides 

\ • . .; , ''^4 '•• • 

' V'. ; ^/^^^^^^ users. Hub computer 505 is comprised of front end 505a, back end 505b, 

■' ?^iP^op^;<>-^^^ .505c and cache memory 505d, Front end 505a is used to receive queries from 

25. i^users and format the results so that they are in a compatible format for the user to understand. 
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' ^^^aackend505b uses the appropriateprotowls to issue broadc^^ 
. messages. Coupled to hub computer 505 are spoke computers 510a, 5 10b p through 501n. 

- : ilte'^*^'"^"*"'^^*""'^^^ that are used to Store indices. 

. SS?^'*^^^^^ eachspokecomputer 5l0a-5l0n is laige memory storage 515a-515n usedto store 

••; ■',s;^-'5|^;)#:records in database- 905. 

emboditnent of the piesent iavention, hub computer 505 and spoke 

i TTie communications between the hub 

■i -^/S '^lll^fP"^^^^ Spoke 

■ ^Id^S^^"^'^ ^^^^"^ "^'"^ * standard database language, such as SQL, Hub 
V . - t:.iilil^iS|&ipputer 505 uses Visual Basic and C-h- to process data. 

; . >5; ■ Sli' : ^'^'^ ' ^ ^^""S'^ 22 show a method and an apparatus for the efficient and effective 
.■ 5 l^M^^"^"^ storage^ inde>dng and retrieval of data infomiation in a distributed data retrieval 
' : •• ^ of data- may be searched and retrieved fester by 

^ data.separate:indexing of thatvjistributed data, and creation of a global 

X: on the basis of the separate indexes. A method and apparatus for accomplishing 

' ^i'l^-^"^'"^^^^^^ management will thus be shown below. 

; ;,. -r^'-'^p. Referring-to Figures 17. and 18, in;st^ 100 of Figure 17 data information is 
J . ■S;^j¥^'^t^«««i'*^l^ intos^^ The process of distributing 

■ > ' ■ -gtfevdata>may be accomplished by sending the data from a central computer terminus 1 10 to 

. . palnodes 120. 130 and 140 of a computer network 10. or by directly entering the data at the 
I • ""^^ ^20. 130 and 140. Further^ the data may be divided such that the divided data is of 

ri : '-j^^^'or unequal sizes, and so that each division of the data has a relational basis within that 

"ch division havingan informational subject relation all its own). Such 
'■^^^^-'M!^'''^^^ ^""^ ^'''^ ^""^ distribution allow for little or no change to current data entiy 

■ • In the case of the Web. data entry can continue as it does now 
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V . MeU entity (i.e.. Universities. Medical Research Facilities, Government Agencies, etc.) can 
- V - illLtinue to.enter data as it sees fit. Thus, the sub-collections 1 50 can be organized in any 
/'J^i- jfishion .and be of an^ 

; I :In step 200. of Figure 17, the daU information, which has been divided and stored into 

■ M ;,die sub^Ilections 150. is indexed and a "sub-collection view" is formed. Indexing of the 
] , '^b*olleption.l5(H4a«^^*c step. of distributing the data, can follow current protocols and may 
I fk% computer-assisted,ormanually aceorapUshed. tt is to be understood, of course, that the 
^'^^'^.^y^^ is notto be limited to a.particular indexing technique or type of technique. 

■ ;.Forinstance.thedatamaybesubjectedt6aprocessof"tokemzation^m^ 
9i|' ■ ' records containing the data are broken down into their constituent words. The resulting 
l^^lcoflection of words.of each document is then subject to "stop-word removal", the removal of 
all function words such as "the", "of and "an", as they are deemed useless for document • 
'Sl&f'::^^. Theremaining words are then subject to the process of "stemming". T^iat is. various 
y i forms of a word are. condensed, or stemmed, to their root form (also called a 

] ,;1#^v."stem"). For.example, all of die words "running", "run", "runner", "runs", .... etc.. are 
,:; ■ , : stemmed to their base form run. Once all of the words in the document have been stemmed, 
' ■ ' each word can be assigned a numeric importance, or "weight". If a word occurs many times in 
,! ■ ' ' the documerit. it-is given a high importance. But if a document is long, all of its words get low 
'■^0^ >■ importance. The culmination of the above steps of indexing convert a document into a list of 
■ 'gi;! ■ weighted words or stems. These lists pf weighted words or stems are thus m the foim. 

documentsub.l,.fwdarw.word,sub.l, weight.sub.l ; word.sub.2, weight.sub.2 ; . . . ; 

' word.sub.n, weight.sub.n. 

Regardless of the indexing technique used, the.index thus far created is then inverted 
• ' .and stored as.an "inverted index", as shown in Figure 19. Inversion of the index requires 
- f?^,: .pulling each word or stem out of each of the documents of the index and creating an index 
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' ibasied on tbe frequency of appeaiance of the words or stems in those documents, A weight is 
\then assigned to each document on the basis of this frequency. Thus, the inverted index, has 

".,* ; /'ffie forrn of: 

'^^,^^mt&l word.sub,i .;fwdarw.docuinentsub.a, weight;sub.a ; docuinent.sub,b, weight,sub.b ; . . . 

..,^'5v^^f;^bcumen^^ 

■ '■ 

" . The inverted index 210 itself, as shown in Figure 1 8, is composed of many inverted 

! . ^ ^\yord indexes 220,' 230 and 240. and can thus be created and organized. As shown, each 

y^tM^C^ -. - ■ • ■■ ■ • 

f^^^^^^'^"^-^^^ ^^^^^ 220, 230 and 240 composes an index of a different word, taken from the 

\ document is weighted in accordance with tlie 

IIO K^^eguency of appearance of the word in that document. Completion of the inverted index 210 

y.;, . allows the derivation of statistical information relating to each word and thus the creation of a 

; J^subrcollection V 410, as , shown in Figure 19. The statistical information which makes up 

410 includes the total number of documents in the sub-collection 150 



f , ;||and', Telating to each word, the number of documents in the sub«collection that contain that 
;H?|fl§1^9rd- As each computer is indexing its sub-collection separately, the total indexing time for 
''D^^^^^^^^^^ the entire, collection is greatly reduced as it^is now shared across many computers. It 



that any method of indexing may be used to form flie sub- 
/ J' , , /IcpHection View 410 and that the above described method is but one of many for 
/■)[; accomplishing that goal. 

. 20 . In step 300 in Figure 17, once the sub-collection view 410 is created, a global view is 
k^'^v^^^'^^'^^^ distributed. For fomiation of the global view, each sub-collection view 410 which 
: ' ^p^hasiibeen created is.coUect^^ from the local nodes 120, 130 and 140 of the computer network 
,{|d^0 and sent to the central computer 110. Referring to Figure 21, showing an embodiment of 
fil^f P^^^^ communication of a computer network 20, sub-collection views from computers 
^"^ ^® sent to central computer 310 along communication paths 4.1 . Collection 



J' 
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'^^^ sending; of the sub^coUection view can be initiated by either the central computer 3 10 or 
'^^^^f'local computers 320» 330 and 340, If collection of the sub-<ollection views 410 is initiated 
, , |i^Sjf*the central computer 3 10, it may be initiated by individual commands sent to each 

\^mputer in the network 20, or as.a group command sent to all of die computers in the 
S^l^etwork 20. If the collection of the sub-collection views 410 is initiated by the local computer 
' «^p,»330 dr 340,^then the4ocal computer may send the sub-collection view upon occurrence 
of the' sub-collection view, anupdate of the sub-collection view, or some otlier 



. ; 'illl'f.^^^^ '^^^^ as' a-spccific time period having elapsed, etc. It is to be understood, of course, 

. /.tl^at any metfiod by which the completed sub-collection views are sent to the central computer 
1 0 ^^yftom the" local computers, is acce^ 

Upon collection of all of the sub-collection views 410, a global view 510 is created as 



1 ftoown itf^igure 22 Jn thefo^^ of the global view 5 10, the central computer 310 uses 
'i |the,sub^cdlleGtions 410 that have been sent from every local computer 320, 330 and 340 to 

jMetermine^how .maiiy electronic records are contained in die sub-collection residing at the 
\ 15f Mffi^^ and for every word, how many electronic records in the sub- 

' ' i%oirecti6n.contain die word in question. The global view 5 1 0 then comprises information 

; ,^|i^aining to how many electronic records there are in all of the sub-collections (i.e,, the total 
' , document sum) and for every word, how many electronic records in all of the sub-collections 
' I ' i^llf v "^^^^ question The global view, tiien, provides all of the necessary information 

/20;^f<>''* useiin weighting^^t^ words in a user query, as will be explained below. It is to be 
course, that any method which provides the central computer with the 
, ^^iiifqm to form the global view may be used. For instance, the sub-collection 

' ' ' ^ '^p'^^^ need not be sent in their entirety themselves, but instead the nodes could send only 
'-^f'^ilatisticd informati 
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f ' : To complete step 300. of Figure 17, the global view 5 10 is sent from the central 
.-i. Computer 310 to each of the local computers 320, 330 and 340 by way of communication 
tpaths 4.2 (as shown in Figure 21), Thus each local node in the network will now have the 
i^lbbal view. It is to be understood, of course, that the description of the formation of the siib- 
• ' ' 5 (%e6llection views and subsequent formation of the global view can be conducted on any 
' , .^xlieomputer network, and thus computer networks 10 and 20 are to be considered 



^ :;tpirit^rchangeable in this description. 



"z:^' ' In step 400 of Figure 1 7, the search phase is conducted. The search phase refers to 
i .search and retrieval of data information stored in the large data text corpora. Thus, to begin 
^1 or t ^with; in the seareh^phase a search query is entered and uploaded by a system user into the 
^feJi ^^il^ciiimput network 10. It is to be understood, of course, that the system user may enter the 
' <" ; ^^search query at any computer location that is connected to the computer network 10. Upon 

of the search query, the .search query is transmitted by the computer network 10 to all of 
.y;:jlf»e|fce local computers 120, 130 and 140 in the computer network 10. 

• After receiving.the search query, each local computer 120, 130 and 140 then mdexes 



'X. 



- , ' ; /.the search query using the same steps that are used to index the documents, namely, for 



,y>l^^ '*stop wordjemoval" and "stemming" and "weighting". The resulting 

(actually:stems).in the query are assigned importance weights using the global view 
' • ) yf\^[Q\^ each lbcal computer 120, 130 and 140 received in step 300. If a query word is used 

20; ' in many documents, then it is presumed to be common and is assigned a low importance 
. . V weight. However, if a handful of documents use a query word, it is considered uncommon 
)^%^^^sind is assigned a high importance weight The "total number of documents in the collection" 
' "^'il' 'iahd .the "number of documents that use the given word" statistics are only available to local 
' ' ' compuTters 120, 130 and 140 after the global view creation. 

:,■ Iff' . • 



It is to be noted, of course, that other formulae might be used as desired. If so, the sub- 
^ ^ i , collection view may be adjusted to account for the different foimula. It should also be noted 
\ ' ^|lti^t having each local computer perform an indexing of the search query might be necessary 
. ' >iifetiie entry point of the seiarch query is at a point which docs not have access to the global 
5 view and thus cannot perform the indexing function. However, if the entry point for the 
; j'search query does have access to the global view, then the search query can be indexed at the 



i'^mpyy point and distributed in an indexed format 



The indexing of die search query, as shown above, yields a weighted vector for the 
..f I /search query of the form: * 

10 , < . . query.fwdarw.word.sub, 1 , weight.sub.l ; word.sub.2, weight.sub.2 ; . . . ; word-sub.n, 

*.\^eightsub.n. 

^ *\ , 

. . . . 

/ ' ) - Having indexed the search query, a simple formula is used to assign a numenc score 

V ^ . ito^eveiy document retrieved in response to the search query. A formula, referred to as a 

. ' . ^Jl vvector inner-product similarity" formula can assign a weight to a word in the search query 



. \ I fyh and another weight to a word in the document being scored. Each document is then sent to the 

Iff ''.^^^ central computer 310^ via communication paths 4.1, from the local computer nodes 320, 330 
, • " ;%and.340;'--' 



In step 500 of Figure 17, once all search results have been returned to the central 
computer via communication paths 4.1, the central computer 3 10 merges the variously 



:20p^\ retrieved documents into a list by comparing the numeric scores for each of the documents. 
\.. J l^'V The scores can simply be compared one against the other and merged into a single Hst of 
' r^^^\^.xpt^^ each of the local computers 320, 330 and 340 used die same 

global view 5 10 for their search process. Upon completion of the merging of the documents, a 
^ complete list is presented to the system user. How many of the documents are returned to the 
' 2-5^puser can, of course, be pre-set according to user or system criteria. In this manner th^ only 
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the documents most likely to be useful, determined as a result of die s)«tem user's search 
• ' ' 'i^'qiiery entered, are presented to flic system user. 

^ ^ V ' It;should be noted that the maimer in which the global view 510 is created provides a 
' : fault tolerant method of distributing, indexing and retrieving of data information in the 
5 ' ,/distributed data r^eval system. That is, in the case where one or more of the sub-collection 
' tJI^'yiews isvunablfr to be collected, by the.central computer, for whatever reason, a search and 
' - *re^eval operation.can sUll be conducted by the user. Only a small portion of the entire 
collection 'is not searched and retrieved. This is because failure by one or more local 
; ,,.;-computers.results:inonlythelossofthesub..collectionsassod The 
data<te;tt corpora collection is still seaichableas it resides on diflferent computers 
. " J ^ , Further, to provide even more fault tolerance, data information may be duplicatively 
, ' Stored m more'than one sub-GoUection. Duplicative storage of the data information Will 
J'\7^jiprotect against not including that data information ina search and retrieval operation if one of 

|he^^sub^bllectiom m the dataanformation is stored is unable to participate in the 
.'^if'tlrsarch. and retrieval. 

Thus the foregoing embodiment of the method and apparatus show that efficient and 
effective,manag«nent ofidistributed infonnation can be accomplished. The current invention 
\, \'ff^pfi^Q division of the, large data text corpora into sub-collections which are then separately 
^5^.Miidexed, which indexes are then used to form a global view, is possible, as shown herein, 
20 without a loss and, in fact^ an increase in the effectiveness and efficiency of a search and 
, system. Furtherv>the search and retrieval operations take less time than current 

^isystemsiwhich either search the entire large collection all at once or which search individual 
' y^^collectibns. . \. . 



Thisfsystem implements;the search queries described above in the following manner. 

25 ^/tf'Firs^ hub computer 505 receives a query from the user. This query can be in the form of a 
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^search term, a taxonomy selection, a category selection, a sub-category selection, etc. Upon 
^reception of flie query, microprocessor 505c compares the query with data stored in cache 
'5p5d If the response tolhe query is already stored in cache 505d, the microprocessor 505c 



|f eturns that response as a -reisuU to the user. Hub computer 505 then waits for another query 

V . * ' * 

5 A;ftom the user. 

If the query is not in cache 505d, microprocessor generates a broadcast message to be 
sent to all spoke computers 5 1 Oa-5 1 On. This broadcast message includes the user's query. 



' , Upon reception, eiach spoke computer 5 1 Oa-5 lOn performs a search of the appropriate 
' ^ l^^ii^dex sttjred therein using the query from the user. In a preferred embodiment of the present 
'lO?^^,j|4in^^ 510a-510n stores all three indices 910, 915a and 915b in 

^ "i^llocal memory as described above. In addition to broadcasting a request across the network to 
' v^^iff^^ent machines, multiple threads could be used and the message could be broadcast to 

^ ||multiple ipfocessdrs in a single machine (on a bus rather than a network). Alternatively, the 
" r , ' search request could be conducted locally- a single process, single thread, single machine 
15 ""Search. 

' ^''^ik ' -^^^^^ *^P^®^^^^d embodiment, datastorage .515a^ 

^'^^Wf ^'^^ ^^^^rds in database 905. Since eachset of data is unique in data storage 515a-5 15n, it 
A^^^^follows diat the relationships between the indices stored in local memories 510al-51 Onl are 
. Valso unique because they cannot all access the same records. In an alternate embodiment, 
i204^spoke computers 5 l5a-515n all share identical copies of database 905, but the 

pdices/databases'910, 91 5a, and 915b are parsed among local memory 510a.510n. 

lEafch spoke computer 5 10a-510n returns the results, either a list or the counts for each 
" category, determined by its respective indices.to hub computer 505. Hub computer 505 
compiles those results andprovides them to the user. In an alternate embodiment, spoke 



•v«* 
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computers 5 1 5a-5 1 5ii are also provided with cache memories to reduce the number of queries 
Wmade to memories 5l5a-5l5tL. . 
; t In another prefeired onbodiment of the present invention, the system and method of 

. I'-lhespresent invention can be performed locally using a single process, single tihread, single 
,5 • ..machine system. 

,v „ figure 1 4 is a system in accordance with the present invention. At block B1405 tlie 
. I^tsystem receives a query ftom-the user. It should be noted that the query may be a term, a 
. ,;:;:::jdxonomyi. a category, a sub-category, a subHsub-category, free text, a field, a numeric range, 

(If B^opleanlpgic, combinations of elements, etc. At block B1410, the query is formulated with 
:abi(3i|tespect.to.tlw gui^^ of the present search. As an example, if the user enteis the 
;rjH^,ike)^^^ is foimulated such that the current taxonomy is taken into 

' ' > consideration (i.e., "Location"). 

, . . At block B141 5, the system deteimines the appropriate categories or sub-categories to 

'" ' V 

. , -search through to locate records that match. As an example, one possible category is 
'^^'^BWf^^m^" Frorn the.deteitniiuitions madein blocks B1410 and B1415, the system has 

;i^naTrowed the number of possible hits by discarding those records that do not conform to the 
;i- j^iplected category, It should be noted that, in a preferred embodiment, the categories or sub- 

,,f :teategories,are deteimined using an organizedlist such as aB-tree, another database or from 

■ ■ • 

•i'^/wheinverted index, itself. 

A^f^^'^^'t'BMZOi the.system checks its cache.. The cache typically stores three types of 
./f .^data; TJlie first type of data is a query result that was recently performed, thus if user A 
; :, "issues a query for term X in category Y, and 1 minute later user B makes the identical query, 
'tiie cache is used to provide the results, , instead of determining the results anew. The second 
;|^type of d4ta stored in the cache is frequently requested queries. Suppose users are, in the 

fr«l"e"tly requesting records on new cars but not requesting records on the disease 
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, ' . Imklariavi results frdm this frequently requested query are then stored in the cache. The 
'third type of data is searches diat are precompiled because otherwise they would take a long 
time to perform. 

If the query is not in the cache, then the query is broadcast to a plurality of processors 
, 5 idperating in parallel at blockB1425. It should be noted that blocks B1420 andB1425 are in 
. . , . J '^^^ because, they are not requirements of the process in order to be operational, but 

^'sl'/M^?^^ preferred embMiments thait enhance the performance of the process. To be more 
'^;/ r f jfspek^ifici i^ query is found in the cache, then blocks B1430-B 1440 are eHniinated and the 

^- f i^i?';^^^^^^^ to provide the user with results is reduced. The use of parallel processors 

, ^Ip' ! ^aPfl^^^^ on eifherportiohs of the queiy or searching only portions of the inverted index also 
" ! recluces the amount of time it takes to provide a result. Thus, a slower perfonning system that 
did not include a cache or parallel processors could also use the present process to generate 



/v.results;" • ♦ 
• ^\ - r ' \ At block- B 1430, the system receives the number of records that "hif * on the query 
, ),yi5 proyided in block Bl 405. At block B 1435, the hits are compiled and the number of hits per 

; • I v. . ..category, as determined in block B 141 5, is also compiled. 

: Q 4- At^ block B 1440, the results are displayed to the user. Typically, these results are 
.:^t; ! vk;j?i>^lSorga^ a preferred embodiment, the system will display a 

Ja-I^^^^^I^ of record hits when there are no sub-categories below the last category selected by 
W >the"user. This prevents giving the user a listing of categories \vith 0 record hits because this 
/' information is not as useful to the user as to know which category die record hits are located 

J. w^:;?^^- . .. ...... 

; V' 'At blbck B1445, a determination is made based upon the results displayed. If the user 
is satisfied with the results, the process ends at block B1450. If die user desires to refme the 
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V \ .; drill-up further into the database, the process continues with a new 

?>^^lfqudiyat^ttock'^ 

[y^S: V Figure 1 5 is a screen shot of a categorizer in accordance with an embodiment of the 
V^ llllpp^ent invehtio^^^ This embodiment of a categorizer is a graphic user interface (GUO that a 
: ;5 f V^^^^^^^ in associating records with categories. Typically, the system 

■ . ; ! ^, >/?Pf^^ uses this embodiment of the present invention to insert a new record into an existing 
ft '^SS'^tll^ fli& taflcon0niy.^Section 1505 is a toolbar that provides such fonctionality as 

Sif ^SR''i3^t^^g^ searching within a record, changing the viewed record, printing, etc. Section 15 1 0 is 
I JllB^^I^^ of the categories in the taxonomy. Section 1515 is a display of the 

, \ \ .-''I*e system bp^ator scrolls through the taxonomy in section 1510 and the record in 
'^^i liff^S^ looking fbt the b^t-fit categoric ft«- the record displayed in section 1515. 

; , v'^v^^l'l^^^ system operator believes he/she has found a best-fit category for the displayed 

instructs the system to make an association between the best-fit category and 



y 'iH^^^lf: ^IiV?a-'preferredembodiment of -the present ir 



c '^.^^'^.^y^^^r^^^i^^^^f^.- * r 1 invention, the record is scanned by the 

? system before it is ^displayed. This scanning procedure compares the key terms stored in 910 



word in the record When a match is made, the record is highlighted so that the 
■ ■ :V i^ksystem^operator-m key- terms are in that record. In addition, a count 



;;i t^^4 'iV on hou^ many key terms are in this record. The system then queries the various 

; ; ^ i;': Jljll^^li^^ category title that matches the key term with the most hits in 

>!'^^^^^^ 'Once that category is determined, that category is displayed along with its parent 

; . ; % ;|;Sateg6ries and its sub-categories so as to provide a frame of reference for the system operator. 
7 ' .'f r' V^^^^^ system operator agrees.wifli the autpmatically determined category, he/she clicks on 
•>f^^^^'^^'-1520:to;.create.an association between that determined category and the displayed 



•''.J>t^iSii.a:i,?yv 



^ system-operator does not agree with suggested category and cannot find another 



; / categoiy by searching through the list of categories, he/she clicks on button 1 525 to 

; A v?^%nsfruct the system to create a new category into the hierarchy. 



The present invention is. not limited to those embodiments described above. For 
5 ^>€xainple, the search tenns entered by the user need not only be textual. The present invention 
' : 'V'^pp^^ perfbrm:searches on dates, phone numbers, number . 

• : l^^^iges, proximityi (i:e;. Is X within 5 miles of Y?), field searches and Boolean searches. In 
. ,: • y f '-^dition^ the present invention may be used with other types of queries such as natural 
^atnguage and context-sensitive queries. 
I f; Another embodiment of the present invention includes alternative queries placed into 

' ^'^^^^ example, before the first query is processed, precompiled queries such as those 

"• '^-^-Slp^^ known to take a long time or are particularly timely, can be pre-loaded into the cache 



5^,tb' save time. 



'-^/^^^^A" ^ present invention is also not limited to two taxonomies. Any data collection can 

. 



'! ; by unlimited number of indq>endent taxonomies. Alternative embodiments 

: ■ • a^^ viewing data by company and industry. If a job listing database is 

>;compiled :the jobs can be viewed by job type, the location of the job, the salary, die required 
' - .^^ ^t.l®^ andjf there.are any special interests (i.e. CPA required). 

-1:/ ' present inventionis also not limited to when certain taxonomies are provided to 

■ ; ; is presented with the taxonomy last selected, Thus, if 

4!" ; - * -Location'' taxonomy and enters a new search term, the results will be 

• ; . : \ : l^^l^^^^ "Location" taxonomy described above. However, in an alternative 

' " V v|^^^ system can switch among taxonomies automatically for the user in an effort 

/ , '';|?^f .P^^^®^^^*^ search-results in.a more meaningful manner. For example, if the user selects 
• : :>^5^^fe?fij^^^ the chain, the system will automatically switch over to another 





; ' . ; : taxonomy so as to provide the user with more context and scope regarding the remaining 

■ f' 

!^search results. Thus, if there iare.no sub-categories under "tires," the present invention will 
' ' f ^^'^^^^ the "Location'* taxonomy so that the user can easily determine where the tire 

;%lsa^^ are located. This switching can also be based on the number of hits. If the category 
: ' 5 ; lepntainis only t>yd hits the systemi will automatically switch to the **Location*' taxonomy and 
y; :- ; J thereby . provide the user with the useful information to locate these uvo tire salesmen. 

. rSiirnilarlyi the automatic taxonomy switching may also be based on a particular taxonomy 
V..;; u^A^iwhere the number of categories or sub^categories is small. For instance, providing the user 
v'l 'v^'ii^^^it that all the hit records are located in one category does not provide any 

the user can use to distinguish between these records. Switching to another 
f ?^^^^^ may provide the user with more categories he/she can use to distinguish between 

,,;^.;]^v/;thehitreco 



'.V^'i^if-'^ ^' It will>be appreciated that there is no limit to the depth of the categories andsub- 
; V:' v^vi^^^^^^^^^ will be Appreciated that the present invention can be implanented 

1^5 . in an interface other than the Web. 

It will ftirther be' appreciated that one preferred embodiment of the present invention is 
'' ^^K a collection of data, said System comprising: an organizer configured 

\ t ^^^S^^to receive search requests^ said organizer comprising: a collection of data having at least ^vo 
' / ; .[entries; wherein the collection of data is organized into at least two taxonomies; wherein each 



> io^"^ bf the atleast two taxononiies is associated with at least two categories; wherein the entries 
' flS 'coirespond'to ableastrone of the at least two taxonomies and also correspond to at least one of 
^ Wr'the atlcast two categories; and a search engine in communication with the collection of data, 
V . wherein said search, engine is configured to search based on the at least two taxonomies and 
based on the at least two. categories, wherein the search engine returns, in response to a search 



request identifying at least a first taxonomy of the at least two taxonomies, a list of the 
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l^^icategones associated with the at least first^identified taxonomies, along with the number of 
associated with eaph«of the categories. associated with the at least first identified 
/taxonomies. 



>, , In,a preferred embodiment of the present inventidn, the returned list of categories 

5 associated with the first taxonomy, along with the number of entries associated with each of 
. '^^tegories assp,ciated with.the identified taxonomies can be further searched with regard 

.to a second of the at least two taxonomi^, whereby the search engine returns, in response to a 
'.K search request identifying the second taxonomy of the at least two taxonomies, a list of the 
.categories associated with all identified taxonomies, along with the number of entries 
'^^fer""'^*^ *^ categories associated with the second taxonomy. 



fetooOier pFefenred embodiment, the search engine, having returned, in response to a 
y ^.search request identifying a first taxonomy of the at least two taxonomies, a lUt of the 
' ;<=ftegories asspciat^-with the identified toonomies, along with the number of entries 
^^apociated with^ch pf the categories associated with the identified taxonomies, will provide 
1^ , Vly thos^j categfpes with a non-zero number of entries associated with the identified 



V Wnomi.e5.and,wll:.fi^^^^ return sub-categories both associated with the category and 

Mir 



'\ '|«ihavingi^^pn.zero number of entries associated with the sub-categoiy. 

•?**«'fi'J*^ritoher preferred embodiment, the search eng 
S^"^"^ ^^^-^^^t^gories both associated with the category, and having a non-zen) number of 
^^t!"" ""^"-^ ""'^ sub^ategoty, .will, in response to a search request identifying a 
,N'^second .taxonomyof the at least two taxonomies.provide a list of the categories witha non- 
^ze^o number of entries associated with the at least second identified taxonomies, along with 
the number of entries associated with each of the categories associated with the second 
'IJdenti fied, taxonom ies. 
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; • ^ In another embodiraeiit, the search engine, having returned, in response to a search 
'J^f^uest identifying a first taxonomy of the at least two taxonomies, a list of the categories 
^^associated with the identified taxonomies, along with the number of entries associated with 
^ ^'^li^ch .of the categories associated with the identified taxonomies, will, in response to a string 
;: " s : ;,%'query, provide those entries which both contain the string and are associated with the 

^'f^dehtifieii taxonomies. The string is preferably one member of the group consistmg of text. 



■ ^'')image,' and graphic. 



> ■ . The present invention can be either a network of computers or a single computer. 

inveiition preferably comprises a cache which stores die returned results 



|^®^.^^''thSbpr6senlin: 

search engine for rapid retrieval. 
■ r ^ p/:: There are many preferred taxonomies, includmg at least one taxonomy selected firom 
: v 1^^^^^^^^^^^^^ price, color, size, style, physical characteristics, delivery 

■?|yp||fkhod, manufacturer, brand, components, ingredients, compatibility, warranty infonnation, 
: i^model yeM^ age, and version; the group consisting of products, services, location, industry, 
■''Jii ''business type, SIC code, NAICS code. Harmonized Code, UNSPC Standard, company 
•: ; vill^pin^^ professional information, and degrees attained; the group consisting of organism, 

: process, molecular fimction, and cellular component; flie group consisting of topic, 

7; /Mi' ^aate published, author, country of origin, language, publication name, publication section, 
• .^pV^;;;:^^ security accessibility, jurisdiction, Dewey Decimal identification, statutory 

:\ i'o;;.J..;,;c^ hierarchical management structure taxonomies, and standardized methodologies 

' * ^'^^^ r^for conducting business taxonomies; and the group consisting of company, industry, job type, 
; ySi- llocation^ salary, experience, certifications, benefits, education, minimum performance 
( -p; • c^equirements, and incentives. 

!i|;V In preferred embodiments, the company information is selected firom size, number of 

2|fe / employees, growth, revenues, financial ratios, and business metrics, and the professional 

•ll^^'if 54 




f^information is selected from school attended, memberships, certifications, specialties, areas of 
...iRtactice. 

r /rt||f; ;; ; In^another prcfetred erabpdim^t of the present invention, the present invention will , • 
r. v??^ntresponse to a search request identifying one niember selected from the group consisting of 
• 5 f'la' tsbconomy, a category, and a sub-categpry, the search engine additionally return an 

:;i^4'dy.^sing>entry. Preferably^ the advatising entry is either a banner advertisement, a search- 

■ ^3 r , Various preferred embodiments of the invention have been described in fulfiUmeat of 
. ip. ;;|he, various objects of the invention. It should be recognized that these embodimaits are 
illustratiYe of the principles of the. invention. Numerous modifications and 
: ' l^Cvi^fi^P^-^ be readily apparent to diose skilled in the art without departing from 



/ \^ 'ff^^^^ inv«ition. 
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■■'M^S^'ii CLAIMS 



. i^^;"^^^^^^^^^^ . A system for searching a collection of data, said system comprising: 



y-'-'MH-i an organizer configured to receive search' requests, said organize comprising: 
^,a;<?pl^ection^of diata having:at least two entries; 

' '''^^^•^^W'^' .^fiwherein Jhe collection of data is organized into at least two taxonomies; 

' V h>:^^^^^ whereineach of the at least ^vo taxonomies is associated with at least two categories; 

: , . '''-^ '-'^K^ ^''^^'■ 
'}'^'p'':■:'•'f: wherein the entries correspond to at least one of the at least two taxonomies and also 

v'jy^^^ at least two categories; and 



a search^engine in communication with the collection of data^ 



. '•''i^^^^:'- • . "VYherein^said:search engineis configured to .search based on the at least two 

^i^^t^^^^^:-: iWherein; the search engine returns, in response to a search request identifying at least a 
't: yi^^^^^^^^^^^^^^^^^ taxonomies^^a list'of the categories associated with (he at 

3' V^StfP^llist first; identified taxonomies, along with the number of entries associated with each of the 

-CyjiS^^ill^ associated with the at least first identified taxonomies. 




The system according to Claim 1, wherein the returned list of categories associated 




Ipcohd iOf the ia t least two taxonomies, whereby the search engine returns, in response to a 
< ■ l^a^ identifying the second taxonomy of the . at least two taxonomies, a list of the 

.il^:^ ,;i||;;^^^ptegQries associated with all identified taxonomies, along with the number of entries 
V of the categories associated with all identified. taxonomies, 
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• 3.' The system according to Claim 1, wherein the search engine, having returned, in 
v^iresponse to a search request identifying at least a first taxonomy of the at least two 



• vn^taxonomies, a Kst of the categories associated with the identified taxonomies, along with the 
associated with each of the categories associated >vith the identified 
' '5,^'Mtaxonomics. will provide only those categpries with a non-zeio number of entries associated 
• -te-with the identified taxonomies and. will furtherretum sub-categories both associated with the 
, >;category;and havinjg a non-zero number of entries associated with the sub-category. 



: }. . ;;f f., The system according to Claim 3, wherein the search engine^ having further returned 



■ 'i^ops^^categories both associated with the category and having a non-zera number of entries 

•V "^t^.a^sociated with the subrcategory, will, in response to a search request identifying at least 



a 

anon- 



- second taxonomy of the at least two taxonomies, provide a list of the categories with 

• „ -1^1 zero numberof entries associated with the at least second identified taxonomies, along with 

; J)ftj^^'^"'"^.^'0^^ of the categories associated with the at least 

, 1 5'(,7s^ond4dentifi€d taxonomies. 



In systera according to Claim 1, wherein the search engine, having returned, in 

s^reh request^identifying at least a fiist taxonomy of the at least two 
■' -r, '• .pf^xbnomies, a list of the categories associated with the identified taxonomies, along widi the 
■ ■. '20,f| :number of entries associated with each of the categories associated with the identified 

4^^,'^°'^0'"i^>^'l>» i'i response ;to a string query, provide those entries which both contain the 
' and-jare associated' with die-identified taxonomies. 



;^,^6. The system according to Claim 5, wherein the string is one member of the group 
:^25;|fconsisting of text, image, and graphic. ' 
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- iSr" "^^^^'^ *"*>««»g to GUim^l. wherdn the system comprises a 



5 "8. 



:j;:«4. ^; "^^ system according, to Claim 1. wherein the system further comprises a cache which 
■ -ii>.stores the retumed results of the search engine for rapid retrieval. 

• . • . ■ . . 

The system for searching a coUection of data according to Claim 1, wherein at least 
</i|||e taxonomy of-the at least two taxonomies is selected from the group consisting of product 
, . price, color,:size. style, physical characteristics, delivery meAod. manufecturer, brand. 

|;Vcpmpoi^nls, ingredients, compatibility, warrantyinfonnation, model year, age, and ve^ 

15^ 1 1>. The system for searching a collection of data according to Qaim 1, wherein at least 
l'-^^. taxonomy of the at least two taxonomies is selected from the group consisting of 

W ''^''^''^^ type. SIC code. NAICS code, Harmonized . 

^;^Eode,MPC Standard,*o^^ p^fessional information, and degrees 



The system according to Claim 1, wherein the system comprises a single computer. 



" ' r—-,~^ — ~~^*«» F'"*w»=>'""«i inionnanon, ana degrees 

* attained. 

I I 

,^12. The system for searching a collectionofdata according to Ck^^ " 
^;;fwy information is^ 

. ■ ;t number of employees, growth, revenues, financial ratios, and business metrics. 



* ' ' ' fwr^' * '^^^ ^y^^ for searching a collection of data according to Qaim 1 1 , wherein the 
l^prpfe information is at least one: characteristic selected from the group consisting of 
. : : school attended, manberships, certifications, specialties, areas of practice. 



SP^^ system forsearching a collection of da^^ 1, wherein at least 

>;--:;-;^^^ne of the at least two taxonomies is selected from the group consisting of 

: ''^^^^ molecular function, species, and cellular component. 

v'w; ■ r|;;|;5 . The system for searching a collection of data according to Claim 1, wherein at least 
' is selected from the group consisting of topic, 

r : '^^^^1^^^^ published^ author; country of origin, language, publication name, publication section, 
:'' jurisdiction, Dewey Decimal identification, statutory 

■ • 1^ ;0dification, hierarchical management structure taxonomies, and standardized methodologies 



; :;|^J^^ip^^o^^ business taxonomies. 



• ' V jI v '^^^^^^^^^^^ The system for searching a collection of data according to Claim 1, wherein at least 

taxonomy of flie at least two taxonomies is selected from the group consisting of 
! !#t^w^^Pa^» industry, job typev location, salary, experience, certifications, benefits, education. 



' ■ ' performance requirements, and incentives. 

' The system for searching a collection of data according to Claim 1 , wherein, in 

■ • {-f response to a search request identifying one member selected from the group consisting of a 
' * • ' a category, and a sub-category, the search engine additionally returns an 
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^"18. ""^^^y^t^^for seaiching a collection of data acconlm^ 
advertising entry is at least one member sdected to the gn>upcons^^^^ . 
advertisement, search-visible storefront, and text-searchable advertising. 



• "J^ ? '19. A system for searching a collection of data, said system comprising: 
means for networking a plurality of computers; and 
7, V V ' ^^'^ executing in said computer network and configured to receive 

. ,.?^;earch requests.frorr, any one of said. plurality of computers, said means fororganizing 

'L 'l::l^rcotnprising: 

, 10 ' , , a collation of data having at least two entries; 
. ; i] wherein.the collection of data is organized into at least two taxonomies; 

wherein each of the at least two^taxonomies is associated with at least two categori 
wherein.the entries correspond to at leastone of the at least two taxonomies and also 
. ^kcorresporid to at least one of the at least two categories; and 

m^ns for searching in communication with thecollection of data 
; ■ ^^^^i'^.^aidWs for searching is configured to search 
, , j ^"^^9 and based on the at [ea?t two categories, 
,^ >^«}«reipthemear^forsearclungretur^^ 
?*least one taxonomy of the at least two taxonomies, a list of the categories associated with the 
20; ^identifiedtaxononiies, along :yith the number of entries associated witheach of the categories 
associated with die identified taxonomies. 



'J 20; The system according to Claim 19. wherein the .^turned list of categories associated 
;;with the first taxonomy, along with the number of entries associated with each of the 
25 : ■ categories associated with the at least identified taxonomies can be further searched with 



>.^^pj?gaKHo atlfeastasecondtaxonomyoftheatleast two taxonomies, whereby the means for 

, , A| searching returns, in response to a search request identifying the at least second taxonomies of 
5^ 14*^ ^ t«onomies..a list of the categories associated with all identified taxonomies. 



^*ong with the mimber of entries associated with each of the categories associated with the at 



:<V-4tg|p!ast-second taxonomies. 



^I^R^ The systemaccoiding to Ckim 19, wherein the means for searching, having returned, 
. ■ l^gfesponse to- a^seaith request identifying al least a first 'taxonomy of the at least two 
>g||gc6nomies, a list of the categories associated with the identified taxonomies, along with the 
, IP^ j,fliimjjer of entries associated with each of the categories associated with the identified 
^§§MM^^' P"'^'^^**'^>' with a non-zero number of entries associated 

and will farther provide sub-categories associated with the 
S^^fe^*^'^^'"^ a-non-zero mmiber of entries associated with the sub-category. 



'•, -.''woisii*' ■.,v• 
^'?»^"Si9p^P'■'• 



i 'S^Mr ^""''"''''^ '° "^^^'^^ ^ ^^^^ s^Tching. having further 

• ^' ''^l^te ^'"^^^ ^'^^ ^^^8^^ having a non-zero number of 

> SJ^^'' subcategory, will, in response to a search request identifying at 

r of the at least two taxonomies, provide a list of the categories with, 

i t ' M g , V?^n.2ero number of entries ^sociated with the at least second identified taxonomies, along 
■4 ?i:|^p^^''"'^^ °f *^ ^^tegories associated with the at least 



identified taxonomies. 



^ a list of the categories associated with the identified taxonomies, along with the 
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' '^mj^onom^, will, in respoTKe te a string queiy,provide those entries which both contain the 
.//Spring and are associated with the identified taxonomies. 



' "^^"^ according to Claim 23, wheiein the string is one member of the group 

' ^^fe?!^°^'^^"S'oftextiimage,:andgraphic. 



, ;||;^2^. 'niesystemaccordingtoaaim..l9.whereinthesystemcomprisesanetworkof 

/ ;V^c'ornputers. 

^ r^^^ The system according to Claim 19, wherein the system comprises a single computer. 

' -'^Ifi;?'. The system according to Claim 19, wherein the system further comprises a cache 
-^f^i^'^^ the returned results of the means for searching for rapid retrieval. 

^^^^y^^'" f°'^^'^hi'^8 a collection of data according to Claim,19, wherein at least 
, |<ipne taxonomy of theatleasttwotaxonomies is selected from the group consisting of product 
^, ^e,,Rrice, color, size, style, .physical characteristics, delivery method, manufacturer, brand. 
' i^&^^ts. compatibility, warranty information, model year. age. and version. 

/ f^*^?. Ihe^system forsearching a coUection of data according to Claim 19. wherein at least 
taxonomyof the.at.least twptaxonomies is selected from the group consisting of 

ilft^'*"''''''^'^'^' '"^^^^^^ ^^^^ Hannonized 

fm^^> UNSPC standard, company information, professional information, and degrees 



25^'^^ ^attained, 
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V ' ^'^^ system for searching a collection of data according to Claim 29, wherein the 

..... . , • " 

; '^^Fi^cpmpany information is at least one charactenstic selected from the group consisting of size, 
1^: number of employees, growth, revenues, financial ratios, and business metrics, 

>^>;;2^^^^ • ■ •' ■ The system for searching a collection of data according to Claim 29, wha^in the 
^ li> 5^ information is at least one characteristic selected from the group consisting of 

• memberships, certifications, specialties, areas of practice. 

: • The syst^ for searching a collection of data according to Qaim 1 9, wherein at least 

' S "^^i^^ two taxonomies is selected from die group consisting of 

; ^ $»|c*gamsm, biological process, molecular function, species, and celhilar component. 



Wii^^m^'' The system for searching a collection of data according to Claim 19, wherein at least 
'■^^^M^ taxonomy of the at least two taxonomies is. selected fit)m the group consisting of topic, 
; i VOT^^ of origin, language, publication name, pubhcation section, 



,v jurisdiction, Dewey Decimal identification, statutory 



• : - hierarchical management structure taxonomies, and standardized methodologies 

i .%%^^|for;.c<m 



■?^^iS: o^he systems for- searching a collection of data according to Claim 19, wherein at least 



• • 4if ^iSPP taxoTOmy oCfherat le^ taxonomies is selected from the group consisting of 
* J^si?^'^^"'^* ^pdustry, job, type, location, salary, experience, certifications, benefits, education, 



■ '^i^^^ performance:requirements,,and incentives. 
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, ; , 35. The system for searching a collection of data according to Claim 19, wherein, in 
V*^^^ response to a search request identifying one member selected fiom the group consisting of a 
.,^i^%4axonomy, a category, and a sub-category, the means for searching additionally returns an 
^yadycrtising entcy. • . . 




: 36, The system for searching a collection of data according to Claim 35, wherein the 
advertising entry is at least one member selected from the group consisting of a banner 
^ ' Mp^^'''^^^^^^^^ ^ search^visible storefront, and text-searchable advertising. 



t ^ffi^P' A method for searching a collection of data, said method comprising: 

cpmmunicating, a search request to a search engine, the search engine being in 

w,,vsj:v]^^ommun^ 

....... . y 

wherein the collection of date has at least two entries; 
wherein the collection of data is organized into at least two taxonomies; 
vy:herein each of the at least two taxonomies is associated with at least two categories; 
wherein the at least two entries correspond to at least one of the at least two 
' yWf^^'^^ to at least one of the at least two categories; 

C^V^' . ^^^^^8. oflhe collection of data by the search engine ba 
search request; 

' .wherein the communieated search request identifies at least one of the at least two 



' ^^'hlj^^taxonomies;. 

returning of a list of the categories assopiated with the at least one identified 
^taxonomies, along with the number of entries associated with each of the categories 
, -associated with the at least one identified taxonomies as a response to the querying of the 
25, ^collection of data. 





' ; 38 . The method for searching a collection of data according to Claim 37, wherein the 

/u..;imethodvfurther comprises 

\t^^v i; returning, in response to a search request identifying iat least a second taxonomy of the 
^^Si^if at least two taxonomies, a list of the categories associated ^th all identified taxonomies, 
\ - ' along with the;nuraber of entries associated with each of the categories associated with the at 
^^ least second taxonomy: 

^' \ 39. The method for searching a collection of data according to Claim 37, wherein die 

ymethod further comprises 
' / ' • retuming.a list of only those categories with a non-zero number of entries associated 



t^4?^^\ w taxonomies and forther returning at least one siib-cat^ory associated with 

## • 

i^!^'!;!!'^^® category and having a non-zero number of entries associated with the sub-category. 



^ ^40- The method for searching a collection of data according to Claim 39, wherein the 

A, Iff ;niethod further comprises 

^^mfC'" having further returned sub-categories both associated with the category and having a 
/ : non-zero number of entries associated with the sub-category, providing, in response to a 
{search request identifying at least a second taxonomy of the at least two taxonomies, provide 
4anist ofethe categories with a non^zero number of entries associated with the at least second 
r^^Kl'i^i'entified taxonomies, along with the number of entries associated with each of the categories 



associated with the at least second identified taxonomies. 



41 . The method for searching a collection of data according to Claim 37, wherein the 
;!2^ > . method ftxrther comprises 
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returning, in response to a string queiy, provide thoseentries which botJi contain the 
Sh^^^*^^!^ and are associated with the identified taxonomies. 



42. The method for searching a collection of data according to ^^^^ 
# ; of text, image, and graphic. 



':0\^¥^'S^' Th^ method for searching a collection of data according to Claim 37, wherein the 
J ; > system compnses a network of computers. 



'iy!^Mt'^' method for searching a collection of data according to Claim 37, wherein the 

^- S^ilS?^ comprises a single computer. 



V vliffl 'f^* The method for searching a collection of data according to Claim 37, wherein the 
fil'ff^^^^ further comprises a cache which stores the returned results of the means for searching 

■'fif%ff^:, .. , ^ HicUiod for searcshing a c6llecti<5ttof data according to Qaim 37, wherein at least 
taxonomy of^e at least two taxonomies is selected from the group consisting of product 
tS^^^^P®' P^*^ colors size^ style, physical chaiacteristics, delivery mettiod, manufecturer, brand, 
• \ compatibility, warranty information, model year, age, and version. 



:. 46. The method for searching a collection of data according to Claim 37, wherein at least 

is selected from the group consisting of 
W- l|f ^i9#cts, servicesi location, industry^ business type, SIC code, NAICS code. Harmonized 
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Standard, company. information, professional information, and degrees 

attained. • 

^ 47. The method for searching a collection of data according to Claim 46, wherein the 
M V company information is at least one characfceristic selected from the group consisting of size, 
number of employees, growth, revenues, financial ratios, and business metrics. 

48. The method for searching a collection of data according to Claim 46, wherein the 
, J J professional, information is at least one characteristic selected from the group consisting of 
tlO^ ^^school attended, memberships, certifications, specialties, areas of practice. 



49. The method for searching a collection of data according to Claim 37, wherein at least 



r ^ one taxonomy, pf the at least two taxonomies is selected from the group consisting of 
' y ^^g^'^isrp* biological process, molecular function, species, and cellular component. 



^ ' 50, The method for searching a collection of data according to Qaim 37, wherein at least 

^ ^^'^ 

vone taxonomy of the at least two taxonomies is selected from the group consisting of topic, 
t «V??*^^clatc,pufeiished, wthor, country of origin, language, publication name, publication section, 
. , industry,>.securi.ty accessibility, jurisdiction, Dewey Decimal identification, statutory 
^,|<^*fi<^at:ion, hierarchical management structure taxonomies, and standardized methodologies 
) / \ for conducting business taxonomies. 



5 K The mQtho4.for searching a collection of data according to Claim 37, wherein at least 
, one taxonomy of the at least two taxonomies is selected from the group consisting of 
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, (Company, industry/job type, location, salary, experience, certifications, benefits, education. 



minimum performance requirements, and incentives. 



\/ • ■ * 

* ^52. The method for searching a collection of data according to Claim 37, wherein the 
%'^^5iif ;jmethodfiirflier comprises 

returning by the search engine additionally, in response to a search request identifying 
^ one member/Selected from the group consisting of a taxonomy, a category, and a sub- 

.... 

V # category, an advertismg entry. 



l4 ^® method for searching a collection of data according to Claim 52, wherein the 

. 1 1' advertising . entry is at least one member selected from the group consisting of a banner 
ht ^l^^^'^l^dxertisementi^ advertising. 

, 54. An article . of maiiufacture comprising: 

' I = a;^omputer usable medium having computer program code means embodied thereon 
l^fl^/ f^^ searchinga collection of data, the computer readable program code means in said article 

lOfmantifecture.c.oniprising: 
^ ^ ^1 ' computer readable program code means for communicating a search request to a 



0 



search engine, the search engine being in communication with a collection of data; 

-wherein the collection of data has at least two entries; 
n- wherein the collection of data is organized into at least two taxonomies; 

wherein each of the at least two taxonomies is associated with at least two categories; 
xyherein the at least Uvq entries correspond to at least one of the at least wo 



^ taxonomies.and also correspond to at least one of the at least two categories; 
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' ^^^^^ computer readable program code.raeans for querying of the collection of data by the 
' ;v' ^ search engine based on the communicated search request; 

wherein a communicated search request identifies at least one of the at least two 
taxonomies: ^acid 



% ' , computer readable program code means for returning of a list of the categories 

¥ ^associatedwith the.at leastpne identified taxonomies, along with the number of entries 
, ' V ^ .associated with each of the categories associated with the at least one identified taxonomies 
^ , as^a response to the querying of the collection of data. 



1'0? ' ' 55, The article of manufacture according to Claim 54, wherein the returned list of 



't'^ \ ^ 'categories associated with the at least first taxonomy, along with the number of entries 
■ ' associated with each of the categories , associated with the identified taxonomies can be further 

' ^^^^ I searched with regard to a second of the at least two taxonomies, whereby the computer 
; ^#i.fj*readable program code means for querying of the collection of data by the search engine 

Yfx^ ! returns, ,in response to a. search request identifying the at least second taxonomies of the at 

«>•".. 

^ least two taxonomies, a lis t of the categories associated with both identified taxonomies, 
along with the number of entries associated with each of the categories associated with the at 



1 ' 



, ^!}^.[] least second :taxoriomies.. 



' 56. The article of manufacture according to Claim 54, wherein the computer readable 
program code means for querying of the collection of data by the search engine, having 
, ^ return^ in response^to a search request identifying at least a first taxonomy of the at least 



two taxonomies, a list of the categories associated with the identified taxonomies, along with 

the number of entries associated with each of die categories associated with the identified 

25^ taxonomies, will provide only those categories with a non-zero number of entries associated 

69 




[rr, with the identified taxonomies and will further provide sub-categories associated with the 
^ ' category and having a 'lionrzero number of entries associated with the sub-category. 

''If flurticle of manufacture according-to Claim 56, wherein the computer readable 

program code means for querying of the collection of data by the search engine, having 
. further returned sub-eategories both associated with the category and having a non-zero 

V ' ' number of entries associated with the sub-category, will, in response to a search request 

' ' %' . 

I '^^^J ' * identifying at least a second taxonomy of the at least two taxonomies, provide a list of the 
u H , categories with: a non-zpro number of entries associated with the at least second identified 



bo; ; taxonomies, along with the number of entries associated with each of the categories 



4' ^associated with the at least second identified taxonomies. 




' ^58. The article ofmanufacture according to Claim 54, wherein the means for s^^ 

. . ' 1 ' having returned, in response to a search request identifying at least a first taxonomy of the at 
' i ^ll^ taxonomies, a list of the categories associated with the identified taxonomies, along 

K W*^?' "^i* the number of entries associated with each of the categories associated with the 

\' ^ ■^^'^^^^^^^ response to a.string query, provide those entries which both, 

i- ■■ . 

^bntain the sfring^and are associated .with the identified taxonomies. 



[ , 59.; ^. ./ITie article of n^ according to Claim 58, wherein the string is one member 



' . Ih^ • ^HS^^P PPnsisting of text, irnage, and graphic. 



\X > article of manufacture according to Claim 54, wherein at least one taxonomy of 

V the at least two taxonomies is selectjed from the group consisting of product type, price, color, 
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:S ■ ^' <^h^^*«"sUes. delivery method, manufacturer, brand, components, 

ll^?"^^'^"''" "^""P^^^^^' warranty information, model year. age. and version. 

^1^^461. >The.article:ofmanu&cture according to Claim 54, wherein at least one taxonomy of 
: . Jff - ; the at least two taxonomies is selected from the group consisting of products, services. 
0r; )ocat.oni industry/business type,- .SIC code, NAICS code, Harmonized Code UNSPC 



^l^^^tandaiAcompany information, pr^^ information, and degrees attained. 



. . .^.Wf : -62. The article of manufacture according toClaim 61. wherein the company information is 
Jl^^tleast one characteristic selected from the group consisting of size, number of employees, 
. V .m.««^K ^ financial ratios, and business metrics. 



'^fe|li'#owth, revenues, 



f ^P" ^ '^^'*^'''*''^'"'"'^'^*^*^"ii«gtoaaim61,whereinfc^ 



'i^^l^ one characteristic selected from the group consisting of school 



I^^S^^ '^^^^ips.o^rtifLcatiotis, specialties, areas of practice 



■ -If.. 



'^rhM^' ^'*'^''^'°^'"^^*'^'^'^'^'^^'"8toClaim54.wherdnatleas^ 
. ,,„|i,,|heat leasttwo taxonomies. is selected from thegroup consisting of topic, date published, 
|^^ti.or..co«ntry of origin, language, publication name, publication section, industty, security 
l^l^sibflity, jurisdiction, Dewey Decimal. identification, statutory codification, hierarchical 
V '^^fmm^ structure taxonomies, and standardized methodologies for conducting business 
^- \\;V|^VV'3taxonomies. 

pil^, --^^^ ■■ • • ■ . - . . 

- " The article of manufecture according to- Claim 54, wherein at least one taxonomy of 

J^^^^aMeast two taxonomies is selected from Ae group consisting of company, industry, job 
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type, location, «lary. experience, certifications, benefits, education. n.ini.un, perfonnance 

requirements, and incentives. 



, W,; ^■n. artole of m.„uiao,„„ according ^ QaiM H »h«i„ a. leas, one taom>»y of 
% *e a. acas,«vo «xo„on,i« is «I««d fi.„„ g„>up comiaing of o,g.d™. biologica, 
j process, molecular function, spoeies, arid cellular componenl. 



M;^. f ««^i^l^ofnianufectureaccordi„gtoCIaim54.wherd^^ 



»7.: 
V 

'-''J:^ ^"'requestiidentifinng one member selected fr^^ ^ _ 

[entry. 



group consisting of a taxonomy, a category, 
' and a sub-category, the search engine additionally returns an advertising < 



r 1 ^) • 

:||f- J''^^^^^'^'^^f™-"^<=^-CIaim67.whereintheadvertis^ 
;|f ..in,emb^.select^ftem the group c^^^^^^^ 

'l^vl^ storefront,, and text-searchable advertisine 
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